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METHODS AND SYSTEMS FOR DYNAMIC GENE EXPRESSION PROFILING 

FIELD OF THE INVENTION 

The invention is related to transcriptional profiling technology. 

5 BACKGROUND 

The introduction of genomics has been instrumental in accelerating the pace of dmg 
discovery. The genomic technologies have proved their value in finding novel drug targets. 
Further improvement in this area will provide more efficient tools resulting in faster and more 
cost efficient development of potential drugs. 

10 The drag discovery process includes several steps: the identification of a potential 

biochemical target associated with disease, screening for active compounds and further chemical 
design, preclinical tests, and finally cUnical trials. The efficiency of this process is still far fi:om 
perfect: it is estimated that about 75% of money spent in the R&D process went to fund failed 
projects. Moreover, the later in the product development a failure occurs, the bigger are losses 

15 associated with this project Therefore, there is a need for early elimination of future failures to 
consid^ably cut costs of the whole drug development process. Thus, the quality of the original 
molecular target becomes a decisive factor for cost-effective dmg development 

One approach that promises to impact on the process of target identification and 
validation is transcription profiling. This method compares expression of genes in a specific 

20 situation: for example, between disease and normal cells, between control and drag-treated cells 
or between cells responding to treatment and those resistant to it. The information generated by 
this approach may directly identify specific genes to be targeted by a ther^y, and, importantly, 
reveals biochemical pathways involved in disease and treatment. In brief, it not only provides 
biochemical targets, but at the same time, a way to assess the quality of these targets. Moreover, 

25 in combination with cell-based screening, transcription profiling is positioned to dramatically 
change the field of drag discovery. Historically, screwing for a potential drag was successfully 
performed using phenotypic change as a maiker in functional cellular systern. For example, 
growth of tuinor cells in cidture was monitored to identify anticancer drags. Similarly, bacterial 
viability was used in assays aimed at identifying antibiotic compoimds. Such screens were 

30 typically conducted without prior knowledge of the targeted biochemical pathway. In fact, the 
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identified effective compounds revealed such pathways and pointed out the true molecular target, 

enabling subsequent rational design of the next generations of dmgs. 

Modem tools of transcription profiling can be used to design novel screening methods 
that will utilize gene expression in place of phenotypic changes to assess effectiveness of a drug. 

5 For example, these methods are described in U.S. Patent Nbs. 5,262,3 1 1 ; 5,665,547; 5,599,672; 
5,580,726; 6,045,988 arid 5,994,076, as well as Luehrsen et al. (1997, Biotechniques, 22:168-74; 
Liang and Pardee (1998, Mol Biotechnol. 10:261-7). Suph approach will be invaluable for drug 
discovery in the field of central nervous system (CNS) disorders such as dementia, mild 
cognitive impairment, depression, etc., where phenotypic screening is in^plicable, but desired 

10 transcription profile can be readily established and linked to particular disorders. Once again, the 
identified effective compounds will reveal the underlying molecular processes. In addition, this 
method can be instrumental for development of improved versions of existent dmgs, which act at 
several biochemical targets at the same time to generate the desired pharmacological effect. In 
such case the change in the transcriptional response may be a better marker for drag action than 

1 5 selection based on optimization of binding to multiple targets. 

Prior to the instant invention, the most advanced method of transcription profiling is 
based on technology using DNA nodcroarrays, for example, as reviewed in Greenberg, 2001 
Neurology 57:755-61; Wu, 2001, J Pathol. 195:53-65; Dhiman et al., 2001, Vaccine 20:22-30; 
Bier et al., 2001 Fresenius J Anal Chem. 371:151-6; Mills et al., 2001, Nat Cell Biol. 3:E175-8; 

20 and as described in U.S. Patent Nos. 5,593,839; 5,837,832; 5,856,101; 6,203,989; 6,271,957; and 
6,287,778* DNA microarray is a method wiiich perfomais simultaneous comparison of the 
expression of several tiiousand genes in a given sample by assessing hybridization of the labeled 
polynucleotide samples, obtained by reverse transcription of mKNAs, to the DNA molecules 
^ attached to the surface of the test array. While the technology provides valuable informatibn 

. 25 about transcriptional changes, it is far firom perfect. 

First of all, this technology is limited to the pool of genes presented in the microarray. 
The current printing n&thods allows placement of 10,000-1 5,000 genes on a single chip, which 
. is essentially a number of genes expressed in a particular cell type. Gfiven the diversity of cell 
types, it requires deverdpment of specific arrays for specific cell types. While flieoretically 
30 possible, this task is hard to acheive, since it requires knowledge about gene pool expressed in 
these cells prior to microarray manufacturing. 
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, Moreover, the number of transcripts in a tissue sample is even higher than in a cellular 

sample and exceeds the current capacity of the microarray. In addition, some changes in gene 

expression result from altemative splicing, which further increases the number of transcripts that 

need to be assessed. The only possibility to overcome these diflBculties will be to develop 

5 multiple arrays that will cover the entire genome, including alternatively spliced genes. This 

approach will significantly increase the cost of a single experiment and will require a large 

biological sample, perhaps larger than is reasonably available. 

Secondly, at present, DNA microarrays do not provide quantitatively accurate data, and 
observed changes in gene expression have to be confirmed by an independent methods, for 
10 example, quantitative PGR (Q-PCR). 

In addition,. a typical microairay experiment includes several manual steps which affect 
the reproducibility of fliis method. 

And finally, the expression of rare transcripts, which may be of particxilar interest, can 
not be accurately measured by microarrays using current detection techniques. These limitations 
15 demonstrate a need to develop altemative methods to perform transcription profiling, preferably 
one that I) will not require prior knowledge of the sequences of the expressed gene pool before 
the assay but by itself will provide this information during/after the assay; 2) will measure 
quantitative changes in the level of expressed transcripts; 3) will be able to detect expression of 
rare genes; and 4) can be automated. 

20 Capillary electrophoresis has been used to quantitatively detect gene expression. Rajevic 

at el. (2001, Pflugers Arch. 442(6 Suppl l):R190-2) discloses a method for detecting differential 
expression of oncogenes by using seven pairs of primers for detecting the differences in 
expression of a nmnber of oncogenes simultaneously. Sense primers were 5' end-labelled witihi a 
fluorescent dye. Multiplex fluorescent RT-PCR results were analyzed by c^illary 

25 electrophoresis on ABI-PRISM 310 Genetic Analyzer. Borson et al. (1998, Biotechniques 

25:130-7) describes a strategy for dependable quantitation of low-abundance mRNA transcripts 
based on quantitative competitive reverse transcription PGR (QC-RT-PCR) coupled to capillary 
electrophoresis (CE) for rapid separation and detection of products. George et al,, (1997, J 
Chromatogr B Biomed Sci Appl 695:93-102) describes the application of a capillaiy . 

30 electrophoresis system (ABI 310) to the identification of fluoresc^it differential display 
generated EST patterns. Odin et al. (1999, J Chromatogr B Biomed Sci Appl 734:47-53) 

3 
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describes an automated capillary gel electrophoresis with multicolor detection for separation and 
quantification of PCR-amplified cDNA. 

Omori et al. (2000, Genomics 67:140-5) measures and compares the amount of 
commercially purchased a-globin mRNA by competitive PGR in two independently reverse 
transcribed cDNA samples using oligo(dT) or oligo(dU) primers. The oUgo(dT) or oligo(dU) 
primers share a 3' oligo^dT) or oligo(dU) sequence and a 5* common sequence. In addition the 
oUgo(dT) or oligo(dU) primer for each sample also contains a unique 29 nucleotide sequence 
between the 3' ohgo(dT) or oligo(dU) sequence and the 5' common sequence. After the 
synthesis of first strand cDNA, PGR is perfomied to amplify the cDNA using a gene-specific 
primer and a primer complementary to the common sequence which is labeled with a unique 
label. The amplified PGR products are then analyzed by spotting onto a detection plate of a 
fluorescence scanner. ^ 

There is a need ia the art for simple, sensitive method for simultaneous quantitative 
detection of gene expression profile in multiple samples. 

SUMMARY OP THE INVENTION 

The invention provides methods and compositions for expression profiling of two of 
more samples. 

The invention provides a method for comparing gene egression profiles of two or more 
samples, the method comprising: 

(a) synthesiziag a pluraUty of first strand cDNAs firom a first sample using a 
. first oligonucleotide primer comprising a sample-specific sequence tag, where the sample- 
specific sequence tagis GG rich at its 5' teraiinal and At rich at its 3 ' terminal; 

: (b) selectively amplifymg at least a subset of the cDNA so as to generate one 
or more sample-specific amplified products; 

(c) . -detecting the abundance of one or more the sample-specific amplified 
products, where the abundance determines an expression profile of one or more genes in the first 
sample; and 

-i 
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(d) comparing the expression profile of the one or more genes in the nrst 

sample with an expression profile of the one or more genes in a second sample, where a 

diiierence in the expression profile indicates differential expression of the one or more genes in 

the two samples. 

5 The invention also provides a method for comparing gene expression profiles of two or 

more samples, the method comprising: 

(a) synthesizing a plurality of first strand cDNAs firom a first sample using a 
first oligonucleotide primer comprising a sample-specific sequence tag, where the first 
oligonucleotide primer comprises at least one degenerate nucleotide; 

10 (b) selectively amplifying at least a subset of the cDNA so as to generate one 

or more sample-specific amplified products; 

(c) detecting the abundance of one or more the sample-specific amplified 
products, where the abundance determines an expression profile of one or more genes in the first 
sample; and 

15 (d) comparing the expression profile of the one or more genes in the first 

sample with an expression profile of the one or more genes in a second sample, where a 
difference in the expression profile indicates differential expression of the one or more genes in 
the two sainples. 

The invention provides a method for comparing gene expression profiles of two or more 
20 samples, the method comprising: 

(a) synthesizing a plurality of first strand cDNAs firom a fiurst sample using a 
first oligonucleotide primer coiiq)rising a sample-specific sequence tag, where the sample- 
specific sequence tag comprises at least one artificial nucleotide which shows a preference of 
base pairing with anottier artificial nucleotide over a conventional niicleotide; 

25 (b) selectively amplifying at least a subset of the cDNA so as to generate one 

or more sample-specific amplified products; 
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(c) detecting the abundance of one or more the sample-specific amplitied 
products, where tiie abundance determines an expression profile of one or more genes in the first 
sample; and 

(d) comparing the expression profile of the one or more genes in the first 
5 sample with an expression profile of ttie one or more genes in a second sample, where a 

difference in the expression profile indicates differential expression of the one or more genes in 
the two samples. 

The invention further provides a method for comparing gene expression profiles of two 
or more samples, the method comprising: 

10 (a) synthesizing a plurality of first strand cDNAs firom a first sample using a 

first oligonucleotide primer comprising a sample-spepific sequence tag, where the sample- 
specific sequence tag is GC rich at its 5 ' terminal and At rich at its 3 ' terminal; 

(b) selectively synthesizing one or mor^ second strand cDNAs 
complementary to the first strand cDNAs using a second oligonucleotide primer comprising a 

15 first arbitrary sequence tag; 

(c) amplifying the one or more second strand cDNA so as to generate one or 
more sample-specific amplified products; 

(d) detecting the abxmdance of one or more the sample-specific amplified 

. products, where the abundance determines an expression profiile of one or more genes in flie first 
20 sample; and 

(e) comparing the expression profile of the one or more gmes in the first 

' ■ ■ . ■ ■ - . ■ , • ■ ' ' ' ■ . • • ■ 

sample with an expression profile of the one or more genes in a second sample, where a 

difference in the expression profile indicates differmtial expression of the one or more genes in 
the two samples, 

. • . . . • 

25 ^ ^ The invention still provides a method for comparing gene expression profiles of two or 
more samples, the method comprising: 
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(a) synthesizing a plurality of first strand cDNAs firom a first sample using a 

first oligonucleotide primer comprising a sample-specific sequence tag, where the first 
oligonucleotide primer comprises at least one degenerate nucleotide; 

(b) selectively synthesizing one or more second strand cDNAs 

5 complementary to the first strand cDNAs using a second oligonucleotide primer comprising a 
first arbitrary sequence tag; 

(c) amplifying the one or more second strand cDNA so as to generate one or 
more sample-specific amplified products; 

(d) detecting the abundance of one or more the sample-specific amplified 

10 products, where the abundance determines an expression profile of one or more genes in the first 
sample; and 

(e) comparing the expression profile of the one or more genes in the first 
sample with an expression profile of the one or more genes in a second sample, where a 
difference in the expression profile indicates differential expression of the one or more genes in 

15 the two samples. 

The invention finrther provides a method for comparing gene expression profiles of two 
or more samples, the method comprising: 

(a) synthesizing a plurality of first strand cDNAs firom a first sample using a 
first oligonucleotide primer comprising a sample-specific sequence tag, where the sample- 

20 specific sequence tag comprises at least one artificial nucleotide which shows a preference of 
base pairing with another artificial nucleotide over a conventipnal nucleotide; 

(b) selectively synthesizing one or more second strand cDNAs 
complementary to the first strand cDNAs using a second oligonucleotide primer comprising a 
first arbitrary sequence tag; 

25 (c) amplifying the one or more second strand cDNA so as to generate one or 

more sample-specific ampUfied products; 



7 



BNSDOCIO: <WO 03035841A2 I > 



wo 03/035841 PCT/US02/34056 

(d) detecting the abundance of one or more tlie sampie-specmc ampuxicu 

products, where the abundance determines an expression profile of one or more genes in the first 
sample; and 

(e) comparing the expression profile of the one or more genes in the first 
5 sample with an expression profile of the one or more genes in a second sample, where a 

difference in the expression profile indicates differential expression of the one or more genes in . 
the two samples. 

The invention provides a method of identifying a modulator which regulates one or more 
gene expression in a sample, the method comprising: 

10 (a) synthesizing a plurality of first strand cDNAs, before contacting the 

i" i-_ * . - . • " 

sample with tiie modulator, using a fiarst oligonucleotide primer comprising a sample-ispecific 
sequence tag, where the sample-specific sequence tag is GC rich at its 5' terminal and At rich at 
its 3' terminal; 

(b) selectively amplifying at least a subset of the cDNA so as to generate one 
15 or more sample-specific amplified products; 

(c) detecting the abundance of one or more the sample-specific amplified 
products, where the abundance determines an expression profile of one or more genes in the 
sample; and 

(d) comparing the expression profile of the one or more genes in the sample 
20 before contacting with the modulator with an expression profile of the one or more genes in the 

saniple afl:er contacting the modulator, where a difference in the expression profile indicates the 
modulator regulating one or more gene expression in the sample. 

The invention also provides a method of identifying a modulator which regulates one or 
more gene expression in a sample, the method comprising: 

25 (a) synthesizing a plurality of first strand cDNAs, before contacting the 

sample with ttie modulator, using a first oligonucleotide primer comprismg a sample-specific 
sequence tag, where the first oligonucleotide primer comprises at least one degenerate 
nucleotide; 
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(b) selectively amplifying at least a subset of the cDNA so as to generate one 

or more sample-specific amplified products; 

(c) detecting the abtmdance of one or more the sample-specific amplified 
products, where the abundance determines an expression profile of one or more genes in the 

5 sample; and 

(d) comparing the expression profile of the one or more genes in the sample . 
before contacting with the modulator wifli an expression profile of the one or more genes in fho 
sample after contacting the modulator, where a difference in the expression profile indicates the 
modulator regulating one or more gene expression in the sample. 

1 0 The invention fiirfher provides a method of identifying a modulator which regulates one 

or more gene expression in a sample, the method comprising: 

(a) synthesizing a pliurality of first strand cDNAs, before contacting the 
sample with the modulator, using a first oligonucleotide primer comprising a sample-specific 
sequence tag, where the sample-specific sequence tag is GC rich at its 5' terminal and At rich at 

15 its 3' terminal; 

(b) synthesizing one or more second strand cDNAs using a second 
oligonucleotide primer comprising a first arbitrary sequence tag; 

(c) amplifying the second strand cDNAs so as to generate one or more 
sample-specific amplified products; 

.20 (d) detecting the abundance of one or more the sample-specific amplified 

. products;, where the abundance determines an e^q^ression profile of one or more genes in the 
sample; and 

(e) comparing the expression profile of the one or more genes in the sample 
before contacting with the modulator with an expression profile of the one or more genes in the 

25 sample after contacting the modulator, where a difference in the expression profile indicates the 
modulator regulating one or more gene expression in the sample. 

The invention still provides a method of identifying a modulator which regulates one or 
more gene expression in a sample, the method comprising: 
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synmesizmg a plurality of iirst strand cDNAs, before contacting the 
sample with the modulator, using a first oligonucleotide primer comprising a sample-specific 
sequence tag, where the first oligonucleotide primer comprises at least one degenerate 
nucleotide; 

5 (b) synthesizing one or more second strand cDNAs using a second 

oligonucleotide primer comprising a first arbitrary sequence tag; 

(c) amplifying the second strand cDNAs so as to generate one or more 
sample-specific amplified products; 

(d) . detecting the abundance of one or more the sample-specific amplified 
10 products, where the abundance detennines an expression profile of one or more genes in the 

sample; and 

(e) comparing ttie expression profile of the one or more genes in the sample 
before contacting with the modulator with an expression profile of the one or more genes in the 
sample after contacting the modulator, where a difference in the expression profile indicates the 

15 modulator regulating one or more gene expression in the saiiaple. 

In a preferred embodiment, the step (a) of the subject method comprises reverse 
transcribing RNA from two or more sample sources into first strand cDNA, and where the 
cDNA is differentially tagged according to their sources. 

Preferably, the plurality of first strand cDNAs is synthesized by reverse transcription 

20 using total RNAs or mRNAs derived from tile first sample, 

- ■ ' • ^l'..- ' ' ■ ■ ' ■ \ ■ . . ' • 

Preferably, the second sequence in the second pfigonucleotide primer is gene-family^^ . 

specific. 

More preferably, the second sequence in the second oligonucleotide primer is a sequence 
encoding a peptide specific for a protein family. 

25 Still more preferably, the second sequence comprises a sequence encoding a signature 

sequence motif for a specific protein family. 

Preferably, the protein family is selected from the group consisting of: receptor tyrosine 
kinases, G protein coupled receptors, seven transmembrane receptors, ion channels/ cytokine . 

10 
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4WK,cptois, tumor mancers, mj^jfjs^ cascade kinases, transcriptional factors, GTPases, ATPases, 

and development protein markers. 

Preferably, a third oligonucleotide primer comprises the sequence-specijBc sequence tag 
of the first oligonucleotide primer is used for the amplifying so as to generate one or more 
5 sample-specific amplified products. 

Also preferably, at least one of the two.or more samples is derived form the group 
consisting of: a normal sample, a disease sample, a sample at a given development stage or 
condition, a sample prior to a given treatment stage or condition, a sample after a given treatment 
stage or condition, and a sample at a given culturing stage or condition. 

10 Still preferably, at leiast one of the two ormore samples is derived firom the group 

consisting of: an aiiimal, an organ, a tissue type, and a cell type. 

In one embodiment, at least one sample is derived firom a normal individual and at least 
another sample is derived firom a disease individual. 

In anotiier embodiment, at least one sample is derived from a development stage of an 
15 individual and at least another sample is derived from a different development stage of the same 
individual. 

In yet another embodimeiit, at least one sample is derived &om a disease stage of an 
individual and at least another sample is derived from a different disease stage of the same 
individual. 

20 In still another embodiment, at least one sample is derived from a stage of a disease 

treatmeM of an individual and at least another s^^^^ 
same disease treatment of the same individual. 

In another embodiment, at least one sample is derived from an individual who was 
exposed to an environmental factor and at least another sample is derived from an individual 
25 who was not exposed to the same environmental factor or who was exposed to the environmental 
factor at a different concentration. 

In one embodiment, the one or more second strand cDNAs are ampUfied by PGR so as to 
generate one or more amplified PGR products. 

♦ 
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X iciciauiy, uic one or more ampunea products are sampled at a predetermined time or 

cycle interval during the amplification. 

In one embodiment, the one or more amplified products are sampled after each cycle of 
the amplification. 

5 Jn another embodiment, the one or more amplified products are sampled after one or 

more predetermined cycles, for example, after cycle 2, 5, 10, 25, 30, or cycle 45. 

In one embodiment, the one or more amplified products are sampled by withdrawing 1% 
to 40% (v/v) of thiB reaction mixture, preferably, by withdrawing 1% to 30% (v/v) of the reaction 
mixture. 

10 In another embodiment, the reaction mixture is replenished after each sampling with 

equivalent volume of a mixture comprising dNTPs, primers, necessary reagents, and a DNA 
polymerase at the same cpncmtration as the starting reaction mixture. 

Preferably, the abundance is detected for each sampled amplified product. 

Preferably, the subject method fiurfher comprises separating the one or more amplified 
1 5 products before detecting the abundance of the one or more amplified products. 

In one embodiment, the one or more amplified products are separated and their 
abundance detected by chromatography. 

In another embodiment, the one or more amplified products are separated and their 
abundance detected by mass spectrometry. 

20 : In yet another embodiment, the one or more amplified products aire separated and their 
abimdauce detected by electrophoresis. 

Preferably, &e one or more amplified products are separated and their abundance 
detected by c^illary electrophoresis. 

In one embodiment, the sample-specific sequence in the first oligonucleotide primer is 
25 1 5-30 nucleotides in length, more preferably, 20-24 nucleotides in length, 

12 
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In a preferred embodiment, the first oligonucleotide primer further comprises a sequence 

of 5* oligo(dT)nVN 3 where n is at least 5; V is dATP, dGTP, or dCTP; and N is dTTP (or 
dUTP), dATP, dGTP, or dCTP. 

Preferably, n is 12-16 in 5' oUgo(dT)nVN 3'. 

5 Also preferably, in the first oligonucleotide primer, the sample-specific sequence tag is . 

located at the 5' of oligo(dT)nVN. 

Preferably, the second oligonucleotide primer of the subject method fiirther comprises a 
second sequence which is complementary to a subset of the first strand cDNAs so as to permit 
the synthesis of one or more second strand cDNAs. 

10 Moie preferably, in the second oligonucleotide primer, the second sequence is located 3* 

of the first arbitrary sequence. 

Also inore preferably, the second oligonucleotide fiirther comprises a sequence of (Z)m 
between the first and second sequences, where Z is a nucleotide which can form base pair with 
any of A, T, G, or C, and m is at least 2. Preferably, m is 4. 

15 hi one embodiment, the second sequence in the second oligonucleotide primer is 5- 10 

nucleotides in length. 

In another embodiment, the second sequence in the second oligonucleotide primer is 6-7 
nucleotides in length. 

Preferably, the second sequence in the second oligonucleotide primer is a palindromic 
20 sequence, 

In one embodiment, the first arbitrary sequence in the second bUgonucieotide prinier is 
15-30 nucleotides in length, preferably 20 nucleotides in length. 

In another embodiment, the first arbitrary sequence in the second oUgonucleotide primer 
comprises an A-T rich region and a G-C rich region. 

25 Preferably, the G-C rich region is located at 5' of the A-T rich region. 

Preferably, the second oligonucleotide primer used is the same for the two or more 

samples to be compared. * ^ 

13 
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In a preferred embodiment, the amplifying step of the subject method tiirttier compnses 
using a fourth oUgonucleotide primer which comprises the first arbitrary sequence tag of the 
second oligonucleotide primer. 

Preferably, the fourth oUgonucleotide primer used is the same for the two or more 
5 samples to be compared. 

X 

In one embodiment, the first strand cDNA is synthesized in a solution without attaching 
to a solid support. 

In another embodiment, the first strand cDNA is synthesized attaching to a solid support. 

Preferably, the solid support is a microparticle or an inner wall of a reaction tube. 

10 . / In a preferred embodiment, the subject method of the invention finther comprises 

separating the one or more second strand cDNA fcom the plurahty of first strand cDNA before 
ampUfying the one or more second strand cDNAs. 

In one anbodiment, the third oligonucleotide primer used in the subject method is linked 
to a detectable label. 

15 Preferably, the detectable is selected firom a group consisting of: fluorescent labels, 

radioactive labels, colorimetrical labels, magnetic labels, and enzymatic labels. 

More preferably, the detectable label is a fluorescent label. 

In a preferred embodiment, the third oligonucleotide primer used for each of the two or 
ttore samples is labeled with a sample-specific label. 

20 ;^ In one embodiment according to the subject method of the invention, the difference in the 
expression profile of the one or more genes is measured by a ratio of sample-specific detectable 
iabeis on amplified products firom the genes between two or more samples. 

Preferably, the method further comprises generating an amplification plot (signal 
intensity as a function of amplification cycle number), calculating a threshold cycle number (Q) 
25 of ainplification for each of the one or more genes based on the signal intensity of each PGR 
firagment. Operational differential expression of particular gene is determined as a difference in 
threshold cycle number (Ct) for this gene m two (or more) samples more than one cycle in value. 

• J. 
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The threshold cycle number is further used to derive copy number for each gene and to measure 

the difference in the expression by a ratio of copy numbers for gene in two or more samples. 

The method also comprises generating an plot of the rate of signal intensity change as a 
function of number of amplification cycles [derivative of Signal Intensity as a function of cycle 
5 niuner, d(Signal Ihtensity)/d(cycle number)] for each amplified gene. The alternative threshold 
cycle (aCt), determined as a cycle number corresponding to the maximal value of d(Signal 
Intensity)/d(cycle ntunber) for each amplified gene firom one sample, is compared to the aCt for 
the same gene firom another sample. Difference in one cycle between aCt values for the same 
gene in two or more samples is defined as alternative operational differential expression. 

10 Also preferably, the method further comprises collecting PGR fragment or PGR 

firagments correspopding to one or more genes which display operational differential expression 
or alternative operational differential expression, and identifying the sequence of the one or more 
genes. 

In one embodiment, the sequence identities of the one or more genes which are 
1 5 dififer^tially expressed are identified by DNA sequencing. 

In one embodiment, the subject method may further comprise a second amplification 
reaction using the one or more amplified products from the first amplification to generate one or 
more secondly amplified products and detecting the abundance of the one or more secondly 
ampUfied products. 

20 Preferably, the amplifying step of the subject method is performed by PGR. 

The subject method of the invention may further comprise a nested PGR reaction as a 
second amplification reaction. 

The present invention provides a composition for detecting the level of gene expression^ 
comprising a first oUgonucleotLde primer, where the first oligonucleotide primer comprises a 
25 sample-specific sequence tag and where the first oligonucleotide primer comprises at least one 
degenerate nucleotide. 

In one embodiment, the first oligonucleotide primer is provided as a mixture of primers 
comprising [5'-(specific sequence tag)2o.24Ti2-i6AN-3 5 '-(specific sequence tag)2o-24Ti2-i6GN- 
3', and 5'-(specific sequence tag)2o-24Ti2-i6GN-3']. 
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The present invention also provides a composition for detecting the level of gene 

expression, comprising a first oligonucleotide primer, where the first oligonucleotide primer 

comprises a sample-specific sequence tag and where the sample-specific sequence tag is GC rich 

at its 5' terminal and AT rich at its 3' terminal. 

5 Preferably, the subject composition fiirther comprises a second oUgonucleotide primer. 

More preferably, the second oligonucleotide primer comprises a first arbitrary sequence - 

tag. 

Preferably, the second primer fiulher comprises a second sequence which is 
complementary to a sequence of the first strand cDNA. 

1 0 > The subj ect composition may further comprise a third oligonucleotide primer comprising 
the sequence-specific sequence tag of the first oligonucleotide primer. 

The subject composition may further comprise a fourth oligonucleotide primer which 
comprises the first arbitrary sequence tag. 

The subject composition may fiirther comprise one or more components selected fironi 
15 the group of: a reverse transcriptase, a DNA polymerase, a reaction buffer for the reverse 
transcriptase, a reaction buffer for the DNA polymerase, and dNTPs. 

The invention provides a kit for detecting the level of gene expression, comprising a first 
oUgonucleotide primer, wherein the first oligonucleotide primer comprises a sample-specific 
sequence tag and wherein the first oligonucleotide primer comprises at least one degenerate 
20 nucleotide, and packaging material thereof. 

The invention also provides a kit for detecting the level of gene expression, comprising a 
first oligonucleotide primer, wherein the first oligonucleotide primer comprises a sample-specific 
sequence tag and wherein the sample-specific sequence tag is GC rich at its 5' terminal and AT 
. rich at its 3 ' terminal, and packaging material thereof. 

25 The kit of the present invention may also comprise a second oUgonucleotide primer. 

Preferably, the second oUgonucleotide primer comprises a first arbitrary sequence tag. 
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The kit of the present invention may further comprise a third oligonucleotide primer 

comprising the sequence-specific sequence tag of the first oligonucleotide primer. 

The kit of the present invention may still further comprise a fourth oligonucleotide primer 
which comprises the first arbitrary sequence tag. 

5 Preferably, the second primer further comprises a second sequence which is 

complementary to a sequence of the first strand cDNA- 

Also preferably, the kit of the present invention further comprises one or more 
components selected from the group of: a reverse transcriptase, a DNA polymerase, a reaction 
buffer for said reverse transcriptase, a reaction buffer for said DNA polymerase, and dNTPs. 

10 BRIEF DESCRIPTION OF DRAWINGS 

Figure 1 is a diagram showing tiae reverse transcription of mRNAs from two samples 
using oligo-dT primers with sample-specific sequence tags according to one embodiment of ttie 
invention. The resulting.cDNAs from each sample are labeled by sample-specific tags. 

Figure 2 is a diagram showing the second strand cDNA synthesis of selected genes using 
15 a primer comprising a gene-family-specific sequence according to one embodimrat of the 
invention. 

Figure 3 is a diagram showing the PGR amplification to generate amplified products with 
sample-specific tags according to one raibodiment of the invention. 

Figure 4 is a diagram showing the separation and analysis of PGR products according , to 
20 one embodiment of the invention. 

Figure 5 is a graph showing typical curves of PGR product accmnulation according to 
one embodiment of the invention. It is apparent that the range of cycles where differences , 
between different samples are most easily detected is narrow, a) The quantitative ineasure of 
gene expression (Ct) is defined as a cycle niunber corresponding to the point at which the signal 
25 intensity exceeds the chosen threshold liiiut (usual set as i 0 fold the standard deviation of the 
baseline). The operational differential expression (ACt) is defined as difference m;Qt values for 
two PGR fragments, b) Alternative determination of threshold cycle based on plotting of 
d(Signal intensity)/d(cycle nmnber) as a function of cycle nimiber. The alternative determination 
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of threshold cycle (aCt) is defined as a cycle number which corresponds to the maximal value ot 

d(Si^al intensity)/d(cycle number). Similar to threshold number, aCt can be used to determine 

absolute copy number for each gene (log(copy number)=AaCt +B). The alternative operational 

differential expression (AaCt) is defined as difiference in aCt values for two PGR Segments. 

.5 Figure 6 is a diagram showing the nomialized PGR Amplification scheme according to 

one embodiment of the invention. 

Figure 7 is a diagram showing the method of transcriptional profiling according to a 
preferred embodiment of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

10 _ Definitions : 

As used herein, the term "sample" refers to a biological material which is isolated fi-om 
its natural environment and containing a polynucleotide. A "sample" according to the invention 
may consist of purified or isolated polynucleotide, or it may comprise a biological sample such 
as a tissue sample, a biological fluid sample, or a cell sample comprising a polynucleotide, A 
15 biological fluid includes blood, plasma, sputum, urine, cerebrospinal fluid, lavages, and 

leukophoresis samples. A sample of the present invention may be any pliant, animal, babterial or 
viral material containing a polynucleotide. 

As defined herein, a "tissue" is an aggregate of cells that perform a particular function in 
an organism. The term "tissue" as used herein refers to cellular material firom a particular 

20 physiological region. The cells in a particular tissue may comprise several different cell types. 
A non-limiting example of this would be brain tissue that fiuther comprises neurons and glial /; 
^)^^ ^^^^ as capillary endothelial cells and blood cells. The term ^tissue" also is intended to 
encompass a plurality of cells contained in a sublocation on the tissue microarray that inaly 
normally exist as independent or non-adherent cells in the organism, for example immune cells, 

25 or blood cells. The term is further intended to encompass cell lines and other sources of cellular 
i^iaterial that now exist which represent specific tissue types (e.g., by virtue of expression of 
biomolecules characteristic of specific tissue types). 
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As used herein, "plurality" refers to more than two. Plurality, according to the invention, 
can be 3 or more, 100 or more, or 1000 or more, for example, up to the number of cDNAs 
corresponding to all mRNAs in a sample. 

As used herein "different types of tissues" refers to tissues which are preferably from 
5 different organs or which are at least from anatomically and histologically distinct sites m the 
same organ. 

As vised herein "a cell sample" is distinguished from a tissue sample in that it comprises a 
cell or cells which are disassociated from other cells. 

As defined herein, "an individual" is a single organism and includes humans, animals, 
10 plants, multicellular and unicellular organisms. 

As used herein, a "saniple-specific sequence" refers to a polynucleotide sequence which 
is used to identify a polynucleotide molecule derived from a specific sample source. A "sample- 
specific sequence" of the present invention indicates the sample source of an isolated or 
synthesized polynucleotide and distinguishes an isolated or synthesized polynucleotide of one 

15 sample from that of another sample. Therefore, a sample-specific sequence has a unique 

characteristic which can be identified The unique characteristic of a sample-specific sequence 
may be a specific sequence identity or a specific sequence length. If a specific sequence identity 
is used, one sample-specific sequence should be different from another sample-specific sequence 
in at least one nucleotide, for example, in at least 2, or 3, or 4, or 5, or 10, or 15, or 20, or more, 

20 up to 60 nucleotides. If a specific sequence length is used, one sample-specific sequence should 
be different in length from another sample-specific sequence in at least one nucleotide, for 
example, in at least 2, or 3, or.4, or 5, or 10, or 15, or 20, or more, up to 50 nucleotides. 

As used herein, a "polynucleotide molecule derived from a specific sample" 
polynucleotide isolated from the specific sample, or it may be a polynucleotide synthesized from 
25 the specific sample, e.g., through the technologies of reverse transcription or polymerase chain 
reaction (PGR), ligase chain reaction (LCR), and polynucleotide-specific based amplification 
(NSBA), strand displacement amplification (SDA) and any other technologies known in the art. 

As used herein, the term "different samples" refers to two or more samples which are ti 
be compared according to tiie subject methods of the invention, whether or not they contain 

30 identical tissue or samples from different sources. Different sources can be, but are not limited 

-ji. 
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to, a disease source and a normal source; different cell types, different tissue or organ types; 

different individuals; samples subjected to different environmental exposures; different 

development stages; different stages of a disease; and different stages of treatment. 

As used herein, the term "amplified product" refers to polynucleotides which are copies 
5 of a portion of a particular polynucleotide sequence and/or its complemratary sequence, which 
correspond in nucleotide sequence to the template polynucleotide sequence and its 
complementary sequence. An "amplified product," according to the invention, may be DNA or 

''it' 

KNA^ and it may be double-stranded or single-stranded. 

As used herein, the terms "synthesis" and "amplification" are used interchangeably to 

ii. 

10 refer to a reaction for generating a copy of a particular polynucleotide sequence or increasing in 
copy number or amount of a particular polynucleotide sequence. It may be accomplished, 
without limitation, by the in vitro methods of polymerase chain reaction (PGR), hgase chain 
reaction (LCR), polynucleotide-specific based amplification (NSB A), or any other method 
known in the art. For example, polynucleotide amplification may be a process using a 

15 polymerase and a pair of oligonucleotide primers for producing any particular polynucleotide 

sequence, i.e,, the target polynucleotide sequence or target polynucleotide, in an amount wluch is 
greater than that initially present. 

As used herein, the term "selectively," when referred to the amplification or synthesis of 
polynucleotide, refers to amplifying or synthesizing a selected group of polynucleotides 
20 comprising a complementary sequence. The selection is achieved by using a specific 

oligonucleotide primer in an amplification or synthesis reaction. For example, a group of second 
jtrand cDNAs may be selectively synthesized by usinjg a second oligonucleotide comprising a 
; Sequence (e.g., the second sequence as described herein a,fler) which is complementary to a gene 
4fainily specific sequeiice. 

25 As used herein, the term "at least a subset" refers to the amplification or synthesis of 

j^either all polynucleotides in a reaction or less than all polynucleotide templates m an 
simplification or synthesis reaction. For example, a subset of polynucleotides (e.g., first strand 
' cDNAs) may be amplified or synthesized by the use of a specific oligonucleotide primer which 
selectively amplifies or synthesizes a group (e.g., a gene family) of polynucleotides firom the 
30 population of all first strand cDNAs. 
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As \ised herein, a "target polynucleotide" is a polynucleotide sequence whose level of 
expression is to be analyzed. A target polynucleotide may be isolated or amplified before its 
expression level is analyzed. For example, a target polynucleotide may be a sequence that lies 
between the hybridization regions of two members of a pair of oligonucleotide primers which are 

5 used to amplify it. A target polynucleotide may be RNA or DNA, for example, it may be mRNA 
or cDNA, a coding region of a gene or a portion thereof A target polynucleotide sequence 
generally exists as part of a larger "template" sequence; however, in some cases, a target 
sequence and the template are the same. Although "template sequence" generally refers to the 
polynucleotide sequence initially present, the products firom an amplification reaction may also 

10 be used as template sequence in subsequent amplification reactions. A "target polynucleotide" 
or a ^template sequence" may be a nomial (e.g., wild type) or a mutant polynucleotide that is or 
includes a particular sequence. 

As used herein, the term "RT-PCR" refers to coupled reverse transcription and 
polymerase chain reaction. This method of amplification uses an initial step in which a specific 
15 oligonucleotide, oligo dT, or a mixture of random primers is used to prime reverse transcription 
of RNA into a first single-stranded cDNA; this cDNA is then amplified using standard 
ampUfication techniques, e.g. PGR, so as to generate a second complementary strand and double- 
stranded cDNA. 

As used herein, an "oligonucleotide primer" refers to a polynucleotide molecule (i.e., 
20 DNA or RNA) capable of annealing to a polynucleotide template and providing a 3 ' end to 
produce an extension product which is complementary to the polynucleotide template. The 
conditions for initiation and extension usually include the presence of four differrat 
deoxyribpnucleoside triphosphates and a polymerization-induciag agent such as DNA 
polymerase or reverse transcriptase, in a suitable buffer ("buffer" includes substituents which are 
25 cofactors, or which affect pH, ionic strength, etc.) and at a suitable temperature. The primer 

according to the invention may be single- or double-stranded. The primer is single-stranded for 
maximum efficiency in amplification, and the primer and its complement form a double-stranded 
polynucleotide. But it may be double-stranded. "Primers" usefiil in the present invention are 
less than or equal to 100 nucleotides in length, e.g;, less than or equal to 90, or 80, or 70, or 60, 
30 or 50, or 40, or 30, or 20, or 15, or equal to 10 nucleotides in length. 

As used herein, the term "arbitrary sequence" is defined as being based upon or subject to 
individual judgement or discretion. In some instances, the arbitrary sequence can be entirely 
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random or partly random for one or more bases. In other instances the arbitrary sequence can be 
selected to contain a specific ratio of each deoxynucleotide. for example approximately equal 
proportions of each deoxynucleotide or predominantly one deoxynucleotide, or to not contain a 
specific deoxynucleotide. The arbitrary sequence can be selected to contain, or not to contain, a 
5 recognition site for specific restriction endonuclease. The arbitrary sequence can be selected to 
either contain a sequence that is complementary to an mRNA or a cDNA of known sequence or 
to not contain sequence from an mRNA or cDNA of known sequence. 

As used herein, "GC rich" refers to a continuous stretch of nucleotides (or 3' terminal 
nucleotide) which has a GC content of at least 60% GC (e.g.. 3 bases of either G or C in a 5 base 

10 long stretch, 4 bases of either G or C in a 6 base long stretch, 5 bases of either G or C in a 7-8 
base long stretch, 6 bases of either G or C in a 9-10 base long stretch, 7 bases of eith^ G or C in 
a 11 base long stiretch, 8 bases of either G or C in a 12-13 base long stretch, or 9 bases of either 
G or C in a 14-15 base long stretch, 10 bases of either G or C in a 16 base long stretch, 11 bases 
of either G or C in a 17-18 base long stretch, 12 bases of either G or C in a 19-20 base long 

15 stretch, 13 bases of either G or C in a 21 base long stretch, 14 bases of either G or C in a 22-23 
• base long stretch, 15 bases of either G or C in a 24 base long stiretch, 16 bases of either G or C in 
a 25-26 base long stretch), or preferably at least 70% GC, or at least 80% GC or at least 90% GC 
or up to 100% GC. 

As used herein, "AT rich" refers to a continuous sti«tch of nucleotides (i.e., including the 
20 5' or 3' terminal nucleotide) which has a AT content of at least 60% AT (e.g., 3 bases of either A 
or T in a 5 base long stiretch, 4 bases of either A or T in a 6 base long stiretch, 5 bases of either A 
or T in a 7-8 base long sti-etch, 6 bases of either A or T in a 9-10 base long stietch, 7 bases of 
' either A or T in a 11 base long stiretch, 8 bases of either A or T in a 12-13 base long stiretch, or S> 
^liases of either A or T in a 14-1 5 base long stiretch, 10 bases of either A or T in a 16 base long 
25 "stietch, 11 bases of eitiier A or Tin a 17-1 8 base long stretch, 12 bases of eitiier A or Tin a 19- 

• 20 base long stretch, 13 bases of either A or T in a 21 base long stietch, 14 bases of eitiier A or T 
^ ina 22-23 base long stiretch, 15 bases of either A or T in a 24 base long stretch, 16 bases of either 
-A or T in a 25-26 base long stiretch), or preferably at least 70% AT, or at least 80% AT or at least 
'^ 90% AT or up to 100% AT. 

30 As used herein, the term "gene family specific" sequence refers to a sequence of 

nucleotides on an oligonucleotide primer which anneals to more than, one polynucleotide 
template in an amplification reaction. A "gene-famUy specific" primer is not required to be 
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completely complementary to a template. Generally, a primer comprising a gene family specitic 
sequence will anneal to at least 2, or 5, or at least 20, usually at least 50 and more, or usually at 
least 75 distinct genes as represented by distinct mKNAs or cDNAs in the sample. The term 
"distinct", when used to describe genes, refers any two genes are considered distinct if they 
5 comprise a stretch of at least 100 nts in their RNA coding regions in which the sequence 

similarity does not exceed 98%, as determined by FASTA (default settings). A "gene-family- 
specific sequence" is at least 4 nucleotides or more in length, e.g., at least 5, 6, 7, 8, 9, 10 or 
more and up to 50 nucleotides in length. 

As used herein, "label" or "detectable label" refers to any atom or molecule which can be 
10 used to provide a detectable (preferably quantifiable) signal, and which can be operatively linked 
to a polynucleotide. Labels may provide signals detectable by fluorescence, radioactiyity, 
colorimetry, gravimetry. X-ray diffiaction or absorption, magnetism, enzymatic activity, mass 
spectrometry, binding affinity, hybridization radiofirequency, nanocrystals and the like. A primer 
of the present invention may be labeled so that the ampUfication reaction product may be 
15 "detected" by "detecting" the detectable label. "Qualitative or quantitative" detection refers to 
visual or automated assessments based upon the magnitude (strength) or number of signals 
generated by the label. A labeled polynucleotide (e.g., an oligonucleotide primer) according to 
the methods of the invention is labeled at the 5' end, the 3' end, or both ends, or intemaliy. Hie 
label can be "direct", e.g., a dye, or "indirect", e.g., biotin, digoxin, alkaline phosphatase (AP), 
20 horse radish peroxidase (HRP). For detection of "indirect labels" it is necessary to add 
additional components such as labeled antibodies, or enzyme substrates to visualize the, 
c^tured, released, labeled polynucleotide fi:agment. In a preferred embodiment, an 
oligonucleotide primer is labeled with a fluorescent labeL Suitable fluorescent labels include 
fluorochrornes such as rhodaxriine and derivatives (such as Texas Red), fluorescem 
25 derivatives (such as 5-bromomethyl fluorescein), Lucifer Yellow, lAEDANS, 7-Me2N- 
coumarin4-acetate, 7-OH-^-CH3-c6umarin-3-acetate, 7-NH2-4-CH3-coumarin-3-acetate 
(AMCA), monobromobimane, pyrene trisulfonates, such as Cascade Blue, and 
monobromorimethyl-ammoniobimane (see for example, DeLuca, Immunofluorescence Analysis ^ 
m Antibody As a Tool, M^chalonis, et al, eds., John Wiley & Sons, Ltd., (1982), which is 
30 incorporated herein by reference). 

The term "Imked" means covalently and non-covalently bonded, e.g., by hydrogen, lomc, 
or Van-der-Waals bonds. Such bonds may be formed between at least two of the same or 
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different atoms or ions as a result of redistribution of electron densities of those atoms or ions. A 
polynucleotide of the invention (e.g., an oligonucleotide primer) can be linked to a detectable 
label and/or a solid support. 

As used herein, the term "opposite orientation", when referring to primers, means that 
one primer comprises a nucleotide sequence complementary to the sense strand of a target 
polynucleotide template, and another primer comprises a nucleotide sequence complementary to. 
the antisense strand of the same target polynucleotide template. Primers with an opposite 
orientation may generate a PC31 amplified product from matched polynucleotide template to 
which they complement Two primers with opposite orientation may be referred to as a reverse 
primer and a forward primer. 

As used herein, the term "same orientation", means that primers comprise nucleotide 
sequences complemeritaiy to the same strand of a target polynucleotide template. Primers with 
same orientation will not generate a PGR amplified product fix>m matched polynucleotide 
template to which they complement 

As used herein, a "polynucleotide" generally refers to any polyribonucleotide or poly- 
deoxyribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. 
"Polynucleotides" include, without limitation, single- and double-stranded polynucleotides. As 
used herein, the.term "polynucleotide(s)" also includes DNAs or RNAs as described above, that 
contain one or more modified bases. Thus, DNAs or RNAs with backbones modified for 
stability or for olher reasons are "polynucleotides". The term "polynucleotides" as it is used 
herein embraces such chemically, enzymatically or metabolically modified forms of 
polynucleotides, as well as the chemical forms of DNA and RNA characteristic of viruses and 
i^Us, inchiding for example, simple and complex cells. A polynucleotide usefiil for tiie present 
^Vention iiiay be an isolated or purified polynucleotide or it may be an ampUfied polynucleotide 
ih an amplification reaction. 

^ ... . . As used herein, "isolated" or "purified" when used in reference to a polynucleotide 
means that a naturally occurring sequence has been rranoved from its normal cellular (e.g., 
•chromosomal) environmrait or is synthesized in a non-natural environment (e.g., artificially 
synthesized). Thus, an "isolated" or "purified" sequence may be in a ceU-free solution or placed 
in a (Afferent ceUiiar environment The term "purified" does not imply that the sequence is tiie 
only nucleotide present, but that it is essentially free (about 90-95%, up to 99-100% pure) of 

24 



. <WO 03035e*1A2J_> 



wo 03/035841 PCTAIS02/34056 
non-nucleotide or polynucleotide material naturally associated with it, and thus is distinguished 

firom isolated chromosomes. 

As used herein, the term "cDNA'* refers to complementary or copy polynucleotide 
produced from an RNA template by the action of RNA-dependent DNA polymerase (e.g., 
5 reverse transcriptase). A "cDNA clone" refers to a duplex DNA sequence complementary to an 
RNA molecule of interest, carried in a cloning vector. 

As used herein, "genomic DNA" refers to chromosomal DNA, as opposed to 
complementary DNA copied from an RNA transcript "Genomic DNA", as used herein, may be 
all of the DNA present in a single cell, or may be a portion of the DNA in a single cell. 

10 As used herein, "complementary" refers to the ability of a single strand of a 

polynucleotide (or portion thereof) to hybridize to an anti-parallel polynucleotide strand (or 
portion thereof) by contiguous base-pairing between the nucleotides (that is not interrupted by 
any unpaired nucleotides) of the anti-parallel polynucleotide smgle strands, thereby forming a 
double-stranded polynucleotide between the complementary strands. A first polynucleotide is 

15 said to be "completely complementary" to a second polynucleotide strand if each and every 
nucleotide of the first polynucleotide fonns base-paring with nucleotides within the 
complementary region of the second polynucleotide. A first polynucleotide is not completely 
complementary (i.e., partially complementary) to the second polynucleotide if one nucleotide in 
the first polynucleotide does not base pair with the corresponding nucleotide in the second 

20 polynucleotide. The degree of complementarity between polynucleotide strands has significant 
effects on the efficiency and strength of annealing or hybridization between polynucleotide 
strands. This is of particular importance in amplification reactions, which depend upon binding 
: between polynucleotide strands. 

The terai "expression" refers to the production of a protein or nucleotide sequence in a 
25 cell or in a cell-free system, and includes transcription into an RNA product, post-transcriptional 
modification and/or translation into a protein product or polypeptide from a DNA encoding that 
product, as well as possible post-translational modifications. 

As used herein, the term "comparing the gene expression profile" refers to comparing the 
deferential expression of one or more polynucleotides in two or more samples. 
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As used herein, the tenn "expression proffle" refers to quantitative (i.e., abundance) and 
qualitative expression of one or more genes in a sample. 

As used herein, the term "difference in the expression profile" refers to the quantitative 
(i.e., abundance) and qualitative difference in expression of a gene. There is a "difference in the 

5 expression profile" if a gene expression is detectable in one sample, but not in another sample, 
by known methods for polynucleotide detection (e.g., electrophoresis). Alternatively, a 
"difference in the expression profile" exists if the quantitative difference of a gene expression. 
(i.e., increase or decrease) betwerai two samples is about 20%, about 30%, about 50%, about 
70%, about 90% to about 100% (about two-fold) or more, up to and including about 1.2 fold, 2.5 

10 fold, 5-fold, 10-fold, 20-fold, 50-fold or more. A gene with a diffarence in the expression profile 
between two samples is a gene which is differentially expressed in the two samples. 

As used herein, the term "differential expression" refers to both quantitative, as well as 
quaUtative, differences in a polynucleotide (e.g., a gene)'s temporal and/or cellular expression 
patterns among two or more samples, i.e., a difference in expression profiles. A polynucleotide 

15 is said to be "differentially expressed" if its expression is detectable in one sample, but not in 
anotiier sample, by known methods for polynucleotide detection (e.g., electrophoresis). A 
polynucleotide is also said to be "differentially expressed" if the quantitative difference of its 
expression (i.e., increase or decrease) between two samples is about 20%, about 30%, about 
50%, about 70%, about 90% to about 100% (about two-fold) or more, up to and including about 

20 1.2 fold, 2.5 fold, 5-fold, 10-fold, 20-fold, 50-fold or more. A "differentially expressed" gene 
transcript means a mRNA transcript that is found in different numbers of copies between two or 
more samples, e.g., in activated versus inactivated states, in different cell or tissue types of an 
individual at one development stage versus another development stage, m different cell or tissue 
types of an individual having a selected disease compared to the numbers of copies or state of tiie 

.25 gene transcript found in the same cells or tissues of a healthy organism. Since the number of 
mRNA transcript copies is proportional to the threshold cycle (a) the later can also be used for 
giMntitative estimation of differential expression. Therefore the gene can be considered as 
differentially expressed if the difference in Ct value for gene in two different samples is moxo 
than a cycle. 

30 As used herein, the term "abundance" refers to the amount (e.g., measured. in,^g, jxmol or 

copy number) of a target polynucleotide in a sample. The "abimdance" of a polynucleotide may 
be measured by methods well known m the art (e.g., by UV absorption, by comparing band 
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intensity on a gel with a reference of known length and amount), for example, as described in 

Basic Methods in Molecular Biology. (1986, Davis et al., Elsevier, NY); and Current Protocols 

in Molecular Biologv (1997, Ausubel et al, John Weley & Sons, Inc.). One way of measuring 

the abimdance of a polynucleotide in the present invention is to measure the fluorescence 

intensity emitted by such polynucleotide, and compare it with the fluorescence intensity emitted 

by a reference polynucleotide, i.e., a polynucleotide with a known amount. 

A "polynucleotide having a nucleotide sequence encoding a gene" means a 
polynucleotide sequence comprising the coding region of a gene, z.e, the polynucleotide 
sequence which encodes a gene product. The coding region may be present in either a cDNA, 
genomic DNA or RNA form. When present in a DNA form, the oligonucleotide may be single- 
stranded the sense strand) or double-straaded. Suitable control elements such as 
enhancers/promoters, splice junctions, polyadenylation signals, etc. may be placed in close 
proximity to the coding region of the gene if needed to pemiit proper initiation of transcription 
and/or correct processing of the primary RNA transcript. Altematively, the coding region 
utilized in the vectors of flie present invention may contain endogenous enhancers/promoters, 
splice junctions, intervening sequences, polyadenylation signals, etc. or a combination of both 
endogenous and exogenous control elements. 

As used herein, the term "degenerate nucleotide" denotes a nucleotide which may be any 
of dA, dG, dC, and dT; or may be able to base-pair with at least two bases of dA, dG, dC, and 
dT. An unlimiting Ust of degenerate nucleotide which base-pairs with at least two bases of dA, 
dG, dC, and dT include: Inosine, 5-nitropyrole, 5-nitroindole, hypoxanthine, 6H,8H,4- 
dihydropyrimido[4,5c][l,2]oxacin-7-one (P), 2-amino-6-methoxyaminopurine, dPTP and 8-oxo- 
dGTP. 

As used herein, the term "artificial nucleotide" refers to a nucleotide which is not a 
naturally occurring nucleotide. The term "naturally occurring" refers to a nucleotide that exists 
in nature without human intervention. In contradistinction, the term "artificial nucleotide" refers 
to a nucleotide that exists only with human intervention. A particularly important artificial 
nucleotide is one which shows a preference of base pairing with another artificial nucleotide over 
a conventional nucleotide (i.e., dA, dT, dG, dC and dU) (e.g., as described in OhtsuM et al. 2001, 
Proc. Natl. Acad. Sci,, 98:4922-4925, h^eby mcoiporated by reference). An artificial nucleotide 
is said to "show a preference of base pairing with another artificial nucleotide over a 
conventional nucleotide" when it shows 30% or more base paring ability with an artificial 
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nucleotide as compared to any of the conventional nucleotide. The base pairing ability may be 
measured by the T7 transcription assay as described in Ohtsuki et al. (supra). Other unlimiting 
examples of "artificial nucleotides" may be found in Lutz et al. (1998) Bioorg. Med. Chem. 
Lett., 8:1 149-1 152); Voegel and Benner, (1996) Helv. Chim. Acta 76, 1863-1880; Horlacher et 
5 al. (1995) Proc. Nati. Acad. Sci., 92:6329-6333; Switzer et al. (1993), Biochemistry 32:10489- 
10496; Tor and Dervan (1993) J. Am. Chem. Soc. 115:4461-4467; Piccirilli et al., (1991) 
Biochemistry 30, 10350-10356; Switzer et al. (1989) J. Am. Chem. Soc. 111:8322-8323, all of . . 
>vhich hereby incorporated by references. An "artificial nucleotide" may also be a degenerate 
nucleotide as defined hereinabove. 

10 f As used herein, a "signature sequence motif' refers to an amino acid sequence which 

remain highly conserved among members of a protein family or even among diverse families of 
proteins. These conserved residues, called "sequence motifs" or "signature sequenpes", can 
determine both protein function and structure. They are commonly used in identifying proteins 
or important protein regions such as active sites and binding sites. Sequence motifs are well 

1 5 known for many protein families, hi addition, a potential sequence motif may be foxmd by 
comparing related protein sequences using available computer programs. 

As used herein, a "factor" refers to any substance which a cell requires to survive and/or 
grow and/or proliferate and which can be produced and exported by another celL Such factors 
include, without Umitation, growth factors (e.g., interleukins, insulin, transferrin, hydrocortisone, 
20 fibroblast growfli factor, nerve growth factor, epidermal growth factor), amino acids, and 
vitamins. 

As used herein, "solid support^* means a surface to which a molecule (e.g. an 
oligonucleotide primer) can be irreversibly bound, including biit not limited, to menibranes, 
ssepharose beads, magnetic beads, tissue culture plates, silica based matrices, membrane based 
25 matrices, beads comprising surfaces including but not limited to. styrene, latex or silica based 
materials and other polymers for example cellulose acetate, teflon, polyvinylidene difluoride, 
^nylon, nitrocellulose, polyester, carbonate, polystdphone, metals, zeolites, paper, 2ilumiQa, glass, 
polypropylene, polyvinyl chloride, polyvinyUdene chloride, polytetrafluorethylene, polyetiiylene, 
polyamides, plastic, filter paper, dextran, germanium, silicon, (poly)tetrafluorethylene, gallium . 
30 arsenide, gallium phosphide, silicon oxide, sihcon nitrate and combinations thereof , A solid 
support according to the subject invention includes an iimer wall of a reaction tube. 
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, "Magnetic bead" means any solid support that is attracted by a magnetic field; such solid 

supports include, without limitation, Dynabeads, BioMag Streptavidin, MPG7 Streptavidin, 

Streptavidin MagnesphereJ, Streptavidin Magnetic Particles, AffiniTipJ, any of the Maga line of 

magnetizable particles, BioMag Superparamagnetic Particles, or any other magnetic bead to 

5 which a molecule (e.g. an oligonucleotide primer) may be attached or immobilized. 

As used herein, a "modulator which regulates gene expression" refers to a compound or 
condition capable of either increasing or decreasing the expression of a gene (e.g., at the level of 
transcription) as compared to the expression of the gene in the absence of the compound or 
condition. As used herein, the term "condition" refers to a normal stage, a disease stage, a 

10 disease type, or a developmental stage of an individual, or an environment to which an individual 
is exposed. Where a difference is an increase, the increase may be as much as about 20%, about 
30%, about 50%, about 70%, about 90% to about 100% (about two-fold) or more, up to and 
incliiding about 5-fold, 10-fold, 20-fold, 50-fold or more. Where a difference is a decrease, the 
. decrease may be as much as about 20%, 30%, 50%, 70%, 90%, 95%, 100% (e.g., where there is 

15 no specific protein or RNA present). The level of gene expression (e.g., at the level of 

transcription) may be measured by methods well known in the art, e.g., by Northem Blot, RT- 
PCR as described in Basic Methods in Molecular Biology, (1986, Davis et al., Elsevier, NY); 
and Current Protocols in Molecular Biology (1997, Ausubel et al., John Weley & Sons, Inc.). 
The level of gene eJ5)ression can also be detected by the subject methods as disclosed by the 

20 present invention. A "modulator" according to the present invention, also includes a dmg or a 
therapeutic agent or a potential drug as defined hereinafter. 

As used herein, the terai "poly A site" or "poly A sequence" as used herein denotes a 
DNA seiquence which directs both the termination and polyadenylation of the nascent RNA 
transcript Efficient expression of recombinant DNA sequences in eukaryotic cells requires 
25 expression of signals directing the esfficient termination and polyadenylation of the resulting 
transcript Transcription termination signals are generaUy fomd downstre 
polyadkiylation signal and are a few hundred nucleotides in length. 

As used herein, a "RNA transcript" refers to the product resulting from RNA 

polymerase-cataiyzed transcription of a DNA sequence. When the RNA transcript is a perfect 

-■■ ■ % 

30 complementary copy of the DNA sequence, it is referred to as the primary transcriiptr or it may 

be an RNA sequence derived from posttranscriptional processing of the primary transcript and is 
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referred to as the mature RNA. "Messenger RNA" (mRNA) refers to the RNA that is without 

introns and that can be translated into protein by the cell. 

The term "drug" or "therapeutic agent" includes active fragments or analogs of a drug, 
a protein or a polynucleotide, that have at least 50% of the activity of the full-sized drug. A 
5 drug can be a protein, peptide, or a polynucleotide. 

As defined herein, the "efficacy of a drug" or the "efficacy of a therapeutic agent" is 
defined as ability of the drug or tiierapeutic agent to restore the expression of diagnostic trait to 
values not significanfly different firom normal (as determined by routine statistical methods, to 
^thin 95% confidence levels). 

10 A "disease or pathology" is a change in one or more biological characteristics that 

impairs normal functioning of a cell/tissue, and/or individual. 

As used herein, the term "course of disease" or "disease stage" refers to the sequence of 
events in which a disease develops, causes symptoms and is either recovered firom or continues 
and/or increases in severity. 

15 The present invention provides a method to identify genes that are differentially 

expressed in two or more samples and to measure differences in their levels of expression. The 
present invention is based on RT-PC3R using sample-specific oligonucleotide primers so those 
amplified products are distinguishable according to their sample sources. 

The practice of the present invention will employ, unless otherwise indicated, 
20 conventional techniques of molecular biology, microbiology and recombinant DNA techniques, 
/which are within flie skill of the art: Such techniques are explained fully in the literature. See, 
"^e-g., Sambrook, Fritsch & Maniatis, 1989, Molecular Cl oning: A Laboratorv Manual Second 
Edition : QUgonucleotide Svnthesis (MJ. Gait, ed., 1984); Polynucleotide Hybridization CB.D. 
Hames & S.J. Higgins, eds., 1984); A Practical Guide to Molec ular Cloning (B. Perbal, 1984); 
25 " and a series. Methods in Enzvmologv (Academic Press, Inc.); Short Protocols In Molecular 
Biology. (Ausubel et al., eA, 19195)- The practice of the present invention may also involve 
techniques and compositions as disclosed in U.S. Patent Nos. 5,965,409; 5,665,547; 5,262,311; 
5,599,672; 5,580,726; 6,045,998; 5,994,076; 5,962,211; 6,217,731; 6,001,230; 5,963,456; 
5,246,577; 5,126,025; 5,364,521; 4,985,129. All patents, patent applications, and publications 
30 mentioned herein, both supra and infra, are hereby incorporated by reference. . ./ 
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Sample Sources 

The invention provides for a method of detecting, measuring, and comparing the 
expression of a target polynucleotide in to or more samples, as defined herein. A sample 
according to the invention may contain at least one polynucleotide or it may be a target 

5 polynucleotide itself Prior knowledge of sequence information of the polynucleotide may or 
may not be required depending on particular uses. Useful sample according to the invention 
includes, but is not limited to, a sample of a target polynucleotide (genomic DNA, cDNA or 
RNA), cell, organism, tissue, fluid, plasma, serum, spinal fluid, lymph fluid, synovial fluid, 
urine, tears, stool, extemal secretions of the skin, respiratory, intestinal and genitourinary tracts, 

10 saliva, blood cells, tumors, organs, tissue, samples of in vitro cell culture constituents, natural 
isolates (such as drinking water, seawater, solid materials,) microbial specimens, and objects or 
specimens that have been "marked" with polynucleotide tracer molecules. 

Useful samples of the present invention may be obtained from different sources, 
including, for example, but not limited to, from different individuals, different developmental 

15 stages of the same or different individuals, different disease individuals, normal individuals, 
different disease stages of the same or different individuals, individuals subjected to different 
disease treatment, individuals subjected to different environmental factors, individuals with 
predisposition to a pathology, individuals with exposure to an infectious disease (e.g., HIV)- 
Useful samples may also be obtained from in vitro cultured tissues, cells, or other polynucleotide 

20 containing sources. The cultured samples may be taken from sources including, but are not 
limited to, cultures (e.g., tissue or cells) cultured in different media and conditions (e.g., pH, 
pressure, or temperature), cultures (e.g., tissue or cells) cultured for different period of length, 
cultures (e.g., tissue or cells) treated with different factors or reagents (e.g., a drug candidate, or a 
modulator), or cultures of different types of tissue or cells: 

25 Samples can be obtained from an individual with a disease or pathological condition, 

including, but not limited to: a blood disorder, blood lipid disease, autoimmune disease, bone or 
joint disorder, a cardiovascular disorder, respiratory disease, endocrine disorder, immune 
disorder, infectious disease, muscle wasting and whole body wasting disorder, neurological 
disorders including neurodegenerative and/or neuropsychiatric diseases, skin disorder, kidney 

30 disease, scleroderma, stroke, hereditary hemorrhage telangiectasia, diabetes, disorders associated 
with diabetes (e.g., PVD), hypertension, Gaucher's disease, cystic fibrosis, sickle cell anemia, 
Uver disease, pancreatic disease, eye, ear, nose and/or throat disease, diseases ajfecting the 
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reproductive organs, gastrointestinal diseases (including diseases of the colon, diseases ot the 

spleen, appendix, gall bladder, and others) and the hke. For further discussion of human 

diseases, see Mendelian Inheritance in Man: A Catalog of Human Genes and Genetic Disorders 

by Victor A. McKusick (12th Edition (3 volume set) June 1998, Johns Hopkins University Press, 

ISBN: 0801857422), the entirety of which is incorporated herein. Preferably, samples from a 

nonnal demographically matched individual and/or from a non-disease tissue from a patient 

having the disease are used in the analysis to provide controls. 

In one aspect, the samples are tissue or cell samples obtained from normal and individiial 
human beings with a specific disease. Tissue samples can be obtained from cadavers or from 
patients who have recently died (e.g., from autopsies). Tissues also can be obtained from 
surgical specimens, pathology specimens (e.g., biopsies), from samples which represent "clinical 
^^aste" which would ordinarily be discarded from other procedures. Samples can be obtained 
from adults, children, and/or fetuses (e.g., from elective abortions or miscarriages). Cells can be 
obtained from suspensions of cells from tissues (e.g., from a suspension of minced tissue cells, 
such as from a dissected tissue), from bodily fluids (e.g., blood, plasma, sera, and the hke), from 
mucosal scrapings (e.g., such as from buccal scrapings or pap smears), and/or from other 
procedures such as bronchial lavages, amniocentesis procedures and/or leukophoresis. 

In some aspects, cells are cultured first prior to extracting RNAs for analysis- Cells from 
continuously growing cell Unes, from primary cell lines, and/or secondary cell lines, also can be 
used. 

In another aspect, the samples are tissue or cell samples obtained from normal and 
individual human beings carrying different diseases. 

^'^ In one aspect, a plurality of tissues/ceUs from a single individual are obtained, i.e., the 
samples represent the "whole body" of an individual. Preferably, samples representing *Svhple 
body" according to the invention comprise at least five different types of tissues from a single 
.individual. More preferably, samples representing "whole body" according to the invention 
comprise at least 10 or at least 15 different tissues. Tissues can be selected from tihte group 
consisting of: skin, neural tissue, cardiac tissue, Uvct tissue, stomach tissue, large intestine tissue, 
colon tissue, small intestine tissue, esophagus tissue, lung tissue, cardiac tissue, spleen tissue, 
pancreas tissue, kidney tissue, tissue from a reproductive organ(s) (male or female), adrenal 
tissue, and the like. Tissues from difierent anatomic or histological locations of a smgle organ 
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can also be obtained, e.g., such as from the cerebellum, cerebrum, and medulla, where the organ 

is the brain. Some aspects of the invention comprise samples representative of organ systems 

(i.e., comprising samples from multiple organs within an organ system), e.g., the respiratory 

system, urinary system, kidney system, cardiovascular system, digestive system, and 

reproductive system (male or female). 

In a preferred aspect, cells representing "whole body" may be obtained from tissues as . 
described above and fiirther comprise cells from a bodily fluid of the patient (e.g., from a blood 
sample). 

The samples can coriiprise a plxirality of cells from individuals sharing a trait. For 
example, the trait shared can be gender, age, pathology, predisposition to a pathology, exposure 
to an mfectious disease (e.g., HIV), kinship, death from the same disease, treatment with the . 
same drug, exposure to chemotherapy, exposure to radiotherapy, exposure to hormone ttierapy, 
exposure to surgery, exposure to the same environmental condition (e.g., such as carcinogens, 
pollutants, asbestos, TCE, perchlorate, benzene, chloroform, nicotine and the like), the same 
genetic alteration or group of alterations, expression of the same gene or sets of genes (e.g., 
samples can be from individuals sharing a common haplotype, such as a particular set of HLA 
alleles), and the like. 

Although in a preferred a^ect of the invention, the samples are derived from human 
beings, in one aspect of the invention, samples from other organisms are also used. Tn one 
aspect, the samples comprise tissues from non-human animals which provide a model of a 
disease or other pathological condition. When the samples represent specimens from an animal 
model of a chronic disease, the samples can comprise specimens representing diff^ent stages of 
the disease, e.g., such as from animals in a remission period or an exacerbation period. The 
samples can additionally, or altematively, comprise tissues from a non-human animal having the 
disease or condition which has been exposed to a therapy for treating the disease or condition 
(e.g., drugs, antibodies, protein therapies, gene therapies, antisense therapies, combinations 
thereof and the like). In some aspects, the non-hvmian animal samples can comprise at least one 
cell containing an exogenous polynucleotide (e.g., the animals can be transgenic animals, 
chimeric animals, knockout or knockin animals). ^ 

In still fiirther aspects, samples from plants can be used. Preferably, such samples 
comprise plants at different stages of their life cycle and/or comprise different tj^es of plant 
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tissues, (e.g., at least about five different plant tissues). In one aspect, samples are obtained from 

plants which comprise at least one cell containing an exogenous polynucleotide (e.g., the plant 
can be a transgenic plant). 

Isolation of mRNAs From A Sample 

5 The subject method measures and compares the expression of a gene or genes in two or 

inore samples. In one aspect of the invention, the expression of a gene or genes at the 
transcription level is measured and compared. 

RNA from two or more samples to be compared (e.g., sample A and B) are extracted and 
individually reverse-transcribed into cDNAusmg sample-specific oligonucleotide primers (e.g., 

10 see figure 7: primers 1 A and IB). 

■ ■ ■ ■ ' . ■ 

Polynucleotide comprising RNA (e.g., mRNA) can be isolated jQ?om cells and tissue 
according to methods well known in the art (Ausubel et al., supra) and described below. 

RNA inay be purified ftom tissues according to the following mefliod. Following 
removal of the tissue of interest, pieces of tissue of <2g are cut and quick frozen in liquid 

15 nitrogen, to prevent degradation of RNA. Upon the addition of a suitable volume of 

guanidinium solution (for example 20 ml guanidinium solution per 2 g of tissue), tissue samples 
are ground in a tissuemizer with two or three 10-second bursts. To prepare tissue guanidinium 
solution (1 L) 590.8 g guanidinium isothipcyanate is dissolved in proximately 400 ml DEPC- 
treated H2O. 25 ml of 2 M Tris-HCl, pH 7.5 (0.05 M final) and 20 ml NazEDTA (0.01 M final) 

20 is added, the solution is stirred overnight, tiie volume is adjusted to 950 ml, and 50 ml 2-ME is 
%dded. 

Homogenized tissue samples are subjected to centrifiigation for 10 min at 12,000 x g at 
12'C. The resulting supernatant is incubated for 2 min at 65°C in the presence of 0.1 volume of 
20% SarkosyU layered over 9 ml of a 5.7M CsCl solution (O.lg CsCl/ml), and sq»arated by 

25 ..centrifiigation overnight at i 13,000 x g at 22°C. After carefiil removal of the supernatant, the 
,,tiibe is inverted and drained. The bottom of flie tube (containing the RNA pellet) is placed in a 
50 ml plastic tube and incubated overnight (or longer) at 4''C in the presence of 3 ml tissue 
resuspension buffer (5 mM EDTA, 0.5% (v/v) Saikosyl, 5% (v/v) 2-ME) to alloiy^ complete 
resuspension of the RNA pellet. The resulting RNA solution is extracted sequentially with 

30 25:24:1 phenol/chloroform/isoamyl alcohol, followed by 24:1 chloroform/isoamyl^alcohol, 
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precipitated by the addition of 3 M sodium acetate, pH 5.2, and 2.5 volumes of 100% ethanol, 
and resuspended in DEPC water (Chirgwin et al., 1979, Biochemistry. 18: 5294). 

Alternatively, RNA is isolated from tissues according to the following single step 
protocol. The tissue of interest is prepared by homogenization in a glass teflon homogenizer in 1 
5 ml denaturing solution (4M guanidinium thiosulfate, 25 mM sodium citrate, pH 7.0, O.IM 2-ME, 
0.5% (w/v) N-laurylsarkosine) per lOOmg tissue. Following transfer of the homogenate to a 5- 
ml polypropylene tube, 0.1 ml of 2 M sodium acetate, pH 4, 1 ml water-saturated phenol, and 0.2 
ml of 49:1 chloroform/isoamyl alcohol are added sequentially. The sample is mixed after the 
addition of each component, and incubated for 15 min at 0-4**G after all components have been 

10 added. The sample is separated by centrifiigation for 20 min at 10,000 x g, 4*^0, precipitated by 
the addition of 1 ml of 100% isopropanol, incubated for 30 minutes at -20''C and pelleted by 
centrifiigation for 10 minutes at 10,000 x g, 4**C. The resulting RNA pellet is dissolved in 0.3 ml 
denaturing solution, transferred to a microfiige tube, precipitated by the addition of 0.3 ml of 
100% isopropanol for 30 minutes at -20*'C, and centrifiiged for 10 minutes at 10,000 x g at 4°C. 

15 The RNA pellet is washed in 70% ethanol, dried, and resuspended in 100-200p.l DEPC-treated 
water or DEPC-treated 0.5% SDS (Chomczynski and Sacchi, 1987 . Anal. Biochem .. 162: 156). 

Kits and reagents for isolating total RNAs are commercially available from various 
companies, for example, RNA isolation kit (Stratagene, La Lola, CA, Cat # 200345); PicoPure™ 
RNA Isolation Kit (Arcturus, Mountain View, CA, Cat # KIT0202); RNeasy Protect Mini, Midi, 
20 and Maxi Kits (Qiagen, Cat # 74124). 

In some embodiments, total RNAs are used in the subject method for subsequent 
analysis, e.g., for reverse transcription. In other embodiments, mRNAs are isolated from the 
total RNAs or directly fix)m the samples to use for reverse transcription. Kits and reagents for 
isolating mRNAs are coimriercially available from, e.g., Oligotex mRNA Kits (Qiagen, Cat # 
25 70022). 

Polynucleotides comprising RNA can be produced according to the method of in vitro 
transcription. 

The technique o f in viti'o transcription is well known to those of skill in the art. Briefly, 
the gene of interest is inserted into a vector containing an SP6, T3 or T7 promoter The vector is 
30 linearized with an appropriate restriction enzyme that digests the vector at a single site located 
downstream of the coding sequence. Following a phenol/chloroform extraction,' the DNA is 
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ethanol precipitated, washed in 70% ethanol, dried and resuspended in sterile water. The in vitro 
transcription reaction is performed by mcubating the linearized DNA with transcription buffer 
(200 mM Tris-HCl, pH 8.0, 40 mM MgCk, 10 mM spermidine, 250 NaCl [T7 or T3] or 200 mM 
Tris-HCl, pH 7.5, 30 niM MgCh, 10 mM spermidine [SP6]), dithiothreitol, RNase inhibitors, 
each of the four ribonucleoside triphosphates, and either SP6, T7 or T3 RNA polymerase for 30 
min at 37^C. To prepare a radiolabeled polynucleotide comprising RNA, unlabeled UTP will be 
omitted and ^^S- UTP will be included in the reaction mixture. The DNA template is then 
removed by incubation with DNasel. Following ethanol precipitation, an aliquot of the 
radiolabeled RNA is counted in a scintillation counter to determine the cpm/pl (Ausubel et al., 
supra). 

' ' RNAs isolated from samples are used for synthesizing cDNAs and generating amplified 
pfbducts for the detection and measurement of expression. In preferred embodiments, both 
cDNA synthesis and amplification reactions employ the use of oUgonucleotide primers. 

nftRipning Oligonucleotide Primers of The Invention 

15 Useful oUgonucleotide primers according to the invention may be designed according to 

general guidance well known in the art as described herem, as well as with specific requirement 
as described hereinafter for each steps of the subject method of the invention. 

1. General Strategies fo7' Primer Design 

Oligonucleotide primers are 5 to 100 nucleotides in length, preferably from 17 to 45 
20 nucleotides, although primers of different lengdi are of use. Primers for synthesizing cDNAs are 
preferably 1 0-45 nucleotides, while primers for amplification are preferably about 1 7-25 
Sticleotides. Primers useftd accordmg to. the invention are also designed to have a particular 
felting temperature (Tm) by the method of melting tCTiperature estimation. Commercial 
programs, including OUgo™ , Primer Design and programs available on the intemet, including 
25 ,Primer3 and Oligo Calculator can be used to calculate a Tm of a polynucleotide sequence usefiil 
according to the invention. Preferably, the Tm of an amplification primer useful according to the 
invention, as calculated for example by OUgo Calculator, is preferably between about 45 and 
65^C and more preferably between about 50 and eO^'C. 

Tm of a polynucleotide affects its hybridization to another polynucleotide (e.g., the 
30 annealing of an oligonucleotide primer to a template polynucleotide). In the subject-method of 
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the invention, it is preferred that the oligonucleotide primer used in various steps selectively 
hybridizes to a target template or polynucleotides derived from the target template (i.e., first and 
second strand cDNAs and amplified products). Typically, selective hybridization occurs when 
two polynucleotide sequences are substantially complementary (at least about 65% 
complementary over a stretch of at least 14 to 25 nucleotides, preferably at least about 75%, 
more preferably at least about 90% complementary). See Kanehisa, M., 1984, Polynucleotides 
Res. 12: 203, incorporated herem by reference. As a result, it is expected that a certain degree of 
mismatch at the priming site is tolerated. Such mismatch may be small, such as a mono-, di- or 
tri-nucleotide. Alternatively, a region of mismatch may encompass loops, which are defined as 
regions in which there exists a mismatch in an unintermpted series of four or more nucleotides. 

Nimierous factors influence ttie efficiency and selectivity of hybridization of the primer 
to a second polynucleotide molecule. These factors, which include primer length, nucleotide 
sequence and/or composition, hybridization temperature, buffer composition and potential for 
steric hindrance in the region to which the primer is required to hybridize, will be considered, 
when designing oUgonucleotide primers according to the invention. 

A positive correlation exists between primer length and both the efficiency and acciu^y 
witti which a primer will anneal to a target sequence. In particular, longer sequences have a 
higher melting temperature (Tm) than do shorter ones, and are less likely to be rq)eated within a 
given target sequence, thereby niiniinizing promiscuous hybridization. Primer sequences with a 
high G-C content or that comprise palindromic sequences tend to self-hybridize, as do their 
intended target sites, since unimolecular, rather than bimolecular, hybridization kinetics are 
generally favored in solution. However, it is also important to design a primer that contains 
sufficient numbers of G-C nucleotide pairings since each G-C pair is bound by three hydrogen 
bonds, rather than the two that are foimd when A and T bases pair to bind the target sequence, 
and therefore forms a tighter, stronger bond. Hybridization temperature varies inversely with 
primer anneaUng efficiency, as does the concentration of organic solvents, e.g. formamide, that 
migiht be included in a priming reaction or hybridization mixture, while increases in salt 
concentration facilitate binding. Under stringent annealing conditions, longer hybridization 
probes, or synthesis primers, hybridize more efficiently than do shorter ones, which are sufficient 
under more permissive conditions. Preferably, stringent hybridization is performed in a suitable 
buffer (for example, IX RT buffer, Stratagene Catalog # 600085, IX Pfii buffer, Stratagene 
Catalog #200536; or IX cloned Pfix buffer , Stratagene Catalog #200532, or other buffer suitable 
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for Other enzymes used for cDNA synthesis and ampUfication) under conditions that allow the 

polynucleotide sequence to hybridize to the oligonucleotide primers (e.g., 95''C for PGR 

amplification). Stringent hybridization conditions can vary (for example fiom salt 

concentrations of less than about IM, more usually less than about 500 mM and preferably less 

5 than about 200 mM) and hybridization temperatures can range (for example, from as low as 0**C 
to^eater than 22"C, greater than about 30"C, and (most often) in excess of about 37*'C) 
depending upon the lengths and/or the polynucleotide composition or the bUgonucleotide . 
primers. Longer fragments may require higher hybridization temperatures for specific 
hybridization. As several factors affect the stringency of hybridization, the combination of 

LO parameters is more important tlian the absolute measure of a single factor. 

' Oligonucleotide primers can be designed with these considerations in mind and 
s^thesized according to the following methods. 

2. Oligonucleotide Synthesis 

The oligonucleotide primers themselves are synthesized using techniques that are also 
15 well known in the art. Methods for preparing oligonucleotides of specific sequence are known in 
the art, and include, for example, cloning and restriction digest analysis of appropriate sequences 
and direct chemical synthesis. Once designed, oligonucleotides are prepared by a suitable 
chemical synthesis method, includmg, for example, the phosphotriester method described by 
Narang et al., 1979, Methods in Enzymology, 68:90, the phosphodiester method disclosed by 
20 Brown et al., 1979, Methods in Enzymology, 68:109, the diethylphosphoramidate method 

disclosed in Beaucage et al., 1981, Tetrahedron Letters, 22:1859, and the soUd support method 
disclosed in U.S. Patent No. 4,458,066, or by other chenaical methods using either a commercial 
:*^utoifiate.d oligonucleotide synthesizer (which is commercially available) or VLSIPS 

25 " TheoUgoiiucleotideof the subject invention may be covalently or noncovalentlyl^ 
"*directly or indirectly (e.g., through a linking moiety) to a solid support according to some 
embodiments. OUgonucleotides may be linked with the soUd phase support that they. 
*^s)iithesized on, or they may be separately synthesized and attached to a solid phase support for 
use, e.g. as disclosed by Lund et al, (1988) Polynucleotides Research, 16: 10861-10880; 
30 Albretsen et al, (1990), Anal. Biochem., 189: 40-50; Wolf et al, (1987) Polynucleotides 

Research, 15: 291 1-2926; or Ghosh et al, (1987), Polynucleotides Research, 15: 53§3-5372, U.S. 
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Patent Nos. 5,427,779, 5,512,439, 5,589,586, 5,716,854 and 6,087,102. Methods of 
immobilizing a polynucleotide sequence on a solid support are also provided by the 
manufacturers of the solid support, e.g., for membranes: Pall Corporation, Schleicher & Schuell, 
for magnetic beads; Dyal, for culture plates; Costar, Nalgenunc, and for other supports useful 
5 according to the invention, CPG, Inc. Preferably, oligonucleotides are synthesized on and used 
with the same sohd phase support, which may comprise a variety of forms and.include a variety 
of linking moieties. 

A solid substrate according to the invention is any surface to which a molecule (e.g., 
capture element) can be irreversibly boxmd, including but not limited to membranes, magnetic 

10 beads, tissue culture plates, silica based matrices, membrane based matrices, beads comprising 
surfaces including but not limited to styrene, latex or silica based materials and other polymers 
for example cellulose acetate, teflon, polyvinylidene difluoride, nylon, nitrocellulose, polyester, 
carbonate, polysulphone, metals, zeolites, paper, alumina, glass, polypropylene, polyvinyl 
chloride, polyvinylidene chloride, polytetrafluorethylene, polyethylene, polyamides, plastic, 

15 filter paper, dextran, germanium, siUcon, (poly)tetrafluorethylene, gallitim arsenide, gallixmi 
phosphide, silicon oxide, silicon nitrate and combinations thereof. Useful solid substrates 
according to the invention are also described in Sambrook et al., (1989) Molecular Cloning: A 
Laboratory Manual (2nd ed.). Vols. 1-3, Cold Spring Harbor Laboratory; Ausubel et al., supra, 
U.S. Patent Nos. 5,427,779, 5,512,439, 5,589,586, 5,716,854 and 6,087,102, Southem et al., 

20 1999, Nature Genetics Supplement 21 :5 arid Joos et al., 1997, Analvtical Biochemistrv. 247:96. 
Solid phase supports for use with the invention may have a wide variety of forms, including 
microparticles, beads, and membranes, slides, plates, n:iicromacluned chips, and the like. 

A preferred solid support of the present invention is microparticles. A wide variety of 

microparticle supports may be used with the invention, including microparticles made of 
25 controlled pore glass (CPG), highly cross-linked polystyrene, acrylic copolymers, cellulose, 

nylon, dextran, latex, polyacrolein, and the like, disclosed in the following exemplary references: 

Meth. EnzvmoL . Section A, pages 11-147, vol. 44 (Academic Press, New York, 1976); U.S. 

Patent Nos. 4,678,814; 4,413,070; and 4,046;720; andPon, Chapter 19, in Agrawal, editor, . 

Methods in Molecular Bioloev. Vol. 20, (Humana Press, Totowa, N.J., 1993). Microparticle 
30 supports further include commercially available nucleoside-derivatized CPG and polystyrene 

beads (e.g. available from Applied Biosystems, Foster City, Calif.); derivatized magnetic beads; 

polystyrene grafted with polyethyletie glycol (e.g., TentaGeLTM., R^p Polymere, Xubingen 
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Germany); and the like. Selection of tiie s\q>port characteristics, such as material, porosity, size, 
shape, and the hke. and the type of linking moiety employed depends on the conditions under 
which the oligonucleotides are used. For example, in applications involving successive 
processing with enzymes (e.g., a revise transcriptase or a DNA polymerase), supports and 
5 linkers that minimize steric hindrance of the enzymes and fliat faciUtate access to substrate are 
preferred. Other important factors to be considered in selecting ttie most appropriate 
microparticle support include size uniformity, efficiency as a synthesis support, degree to which 
surface area known, and optical properties, e.g. as explain more fully below, clear smooth beads 
provide instrumentational advantages when handling large numbers of beads on a surface. 

10 - Exemplary linking moieties for attaching and/or synthesiang oligonculeotides on 

microparticle surfaces are disclosed in, for example, Pon et al, (1988) Biotechniques, 6:768-775; 
Webb, U.S. Patent No. 4,659,774; Barany et al. International patent ^plication 
PCT/US91/06103; Brown et al, (1989) J. Chem. Soc. Commun., 1989: 891-893; Damha et al, 
(1990) Polynucleotides Research, 18: 3813-3821; Beattie et al, (1993) Clinical Chemistry, 39: 

15 719-722; Maskos and Southern, (1992) Polynucleotides Research, 20: 1679-1684; and the like. 

Another preferred solid support of the presoit invaition is an inner wall of a reaction 
tube. The reaction tube may be made of any of cellulose acetate, teflon, polyvinylidene 
difluoride, nylon, nitrocellulose, polyester, carbonate, polysulphone, metals, zeohtes, paper, 
alumina, glass, polypropylene, polyvinyl chloride, polyvinylidene chloride, 
20 polyteti^uoretiiyleaie, polyethylene, polyamides, plastic, filtea: paper, dextran, gennanium, 
sihcon, (poly)tetrafluorethylene', gallium arsenide, gallium phosphide, silicon oxide, or silicon 
nitrate. Preferably, the inner wall of a reaction tube is made of polypropyleiie. 

. . f i . Oligonucleotides may also be synthesized on a single (or a few) splid phase support to 
^form an anayofregjoiis uiiiformly coated with synthesized oligonucleotides. Techniques for 
25 synthesi2ang such arrays are disclosed in McGalletal,Interiiational application 

PCT/US93/03767; Pease et al, (1994) Proc. Natl. Acad. Sci., 91: 5022-5026; Southern and 
. Maskos, Mtemational appUcation PCT/GB89/01114; Maskos and Southern (Supra); Southern et 
al, (1992) Genomics, 13: 1008-1017; and Maskos and Southern, (1993) Polynucleotides 
Research, 21: 4663-4669. 

30 Preferably, the invention is implemented with oligonucleotides linked to micfoparticles 

or beads. Microparticle supports and mefliods of covalently or noncovaleatiy linking 

•■k-i 
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oligonucleotides to their surfaces are well known, as exemplified by the following references: 

Beaucage and Iyer (supra); Gait, editor. Oligonucleotide Synthesis: A Practical Approach (IRL 

Press, Oxford, 1984); and the references cited above. Generally, the size and shape of a 

microparticle is not critical; however, microparticles in the size range of a few, e.g. 1-2, to 

5 several hundred, e.g. 200-1000 pm diameter are preferable, as they facilitate the construction and • 

manipulation of large repertoires of oligonucleotides with minimal reagent and sample usage. . 

In some preferred embodiments, commercially available controlled-pore glass (CPG) or 
polystyrene supports are employed as solid phase supports in the invention. Such supports come 
available with base-labile linkers and initial nucleosides attached, e.g. AppUed Biosystems 
10 (Foster City, Calif). Preferably, microparticles having pore size between 500 and 5000 
angstroms aire employed. 

In other preferred embodiments, non-rporous microparticles are employed for their optical 
properties, which may be advantageously used when tracking large numbers of microparticles on 
planar supports, such as a microscope slide. Particularly preferred non-porous microparticles are 
15 the glycidal methacrylate (GMA) beads available from Bangs Laboratories (Carmel, Ind.). Such 
microparticles are usefiil in a variety of sizes and derivatized with a variety of linkage groups for 
synthesizing tags or tag complements. Preferably, for massively parallel manipulations of 
oligoncueltoides, riiicroparticles of 5 pm diameter GMA beads are employed. 

3. Oligonucleotide Primer Design Strategy for cDNA Synthesis 

20 The design of a particular oligonucleotide primer for the purpose of cDNA synthesis and 

amplification reaction of the subject method involves selepting a sequence that is capable of 
recogniziiig and annealing to the target sequence, The Tm of the oligonucleotide is optimized by 
analysis of the length and GG cdntent of the o^ 

The design of a prima: usefiil according to the invention, may be faciUtated by the use of 
25 readily available computer programs, developed to assist in the evaluation of the several 
parameters described above and the optinodzation of primer sequences. Examples of such 
programs are 'TrimerSelect" of the DNAStar™ software package (DNAStar, Inc.; Madison, 
WD, OLIGO 4.0 (National Biosciences, Inc.), PRIMER, Oligonucleotide Selection Program, 
PGEN and AmpUfy (described in Ausubel et al,, supra). 
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An oUgonucleotide primer useful according to the present invention may comprise a 
degenerate sequence consisting one or more degenerate bases, e.g., as described hereinafter. 
Such degenerate oUgonucleotides will behave as normal substrates for polynucleotide kinase. 
DNA Ugase, and other modifying enzymes (HiU, F. Loakes, D. and Brown D.M., 1998, Proc 
Natl Acad Sci US A-, 95:4258-4263). Degenerate nucleotide can be incorporated into an 
oUgonucleotide sequence at any position; i.e., 5% 3' or intemaUy. Degenerated bases are known 
in the art and diffCTent codes are used for the description of different degeneracy (e.g.. Table 1). 



10 



Table I 



Degenerate Base Codes 



Code 


Representation 


W 


Aor.T 


S. 


GorC 


M 


AorC 


K 


GbrT 


R 


AorG 


Y 


CorT 


V 


A or C or G 


H 


A or C or T 


D 


AorGorT 


B 


C or G or T 


N 


AorCorGorT 



Alternatively a degenerate base may be a nucleotide capable of base-pairing wifli at least 
two of dA, dG, dC, and dT. Such useful degenerate bases are usually nucleotide analogues and 
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are known in the art, and as described hereinafter. For example, deoxyinosine (dl) is a naturally 

occurring degenerate base because it will bind to any of the four natural DNA bases, dl, while 

not truly universal, is less destabilizing than mismatches involving the 4 standard bases (i.e.. A, 

T, G, and C). As used herein, '^mive^sal base" refers to a base that exhibits the ability to replace 

5 any of the four normal bases without significantly destabiUzing neighboring base-pair 

interactions or disrupting the expected functional biochemical utility of the modified 

oligonucleotide. Hydrogen bond interactions between dl and dA, dG, dC, and dT are weak and 

unequal, with the result that some base-pairing bias does exist with dI:dG hybridization >dI:dA> 

dI:dG> dI:dT (Kawase, Y. et al, 1986, Polynucleotides Res., 1919:7727-7736; Martin,F.H. etal,, 

10 1985, Polynucleotides Res., 13, 8927-8938; Case-Green,S. C., Southern, E.M., 1994, 

Polynucleotides Res., 22, 131-136). When present in a polynucleotide, dl preferentially directs 

incorporation of dC in the growing nascent strand by a DNA polymerase. 

More recently, non-natural bases have been engineered that fimctionally are true 
universal bases and will not destabilize a Watson-Crick DNA duplex when paired with either 

15 dA, dG, dC, or dT. The appUcations of these universal DNA base analogues have been recently 
reviewed (Loakes, 2001, Polynucleotides Res., 29: 2437-2447). Two examples are 3- 
nitropyrrole 2 -deoxynucloside and 5-nitroindole 2 -deoxynucleoside (5-nitroindole), These two 
examples above act as truly universal bases. Ottier base modifications have been synthesized 
that are more specific. Degenerate bases which base pair with two or more, but not all fom: of 

20 dA, dG, dC, and dT are also usefiil for the subject method of the invention. Examples include 
the pyrintiidine (C or T) analogue 6H,8H-3,4-dihydropyrimido[4,5-c][l,2]oxa2in-7-one, 
designated as "p", and the purine (A or G) analogue N6-methoxy-2,6-diaminopurine, designated 
as "k". The "p" base will pair with dA or dG while the "k" base will pair with dT or dC 
(Bergstrom, D. E., Zhang, P., and Johnson, W.T,, 1997, Polynucleotides Res. 25:1935-1942). 

25 For example, dPTP (dP) can behave as either thymidine (T) or deoxycytidine (dC), 

because the base can exist in either of two tautomeric forms. In the imino-form, dP has the base- 
pairing properties of thymidine and so base-pairs with dA^ whereas in the amino-fomi it mimics 
dC and base-pairs wittt dG (Sekiguchi. M.. 1996. Genes to Cells. 1. no.139-145: Pavlov> Y. et 
al., 1994, Biochemistry, 33: 4695^701), 8-oxo-dGTP base-pairs with either dC or dA 

30 (Sekiguchi, M., supra; Zaccolo, M., et al., 1996, J. Mol. BioL, 255: 589-603). 

The oligonucleotide primers of the subject invention may comprise artificial nucleotides 
as defined hereinabove in the definitions. The artificial nucleotides may be located In 5% 3' or 
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internal of an oligonucleotide primer of the subject invention. An "artificial nucleotide" may be 
used in an oUgonudeotide so as to reduce non-specific annealing and background amplification 
and to increase the specificity of polynucleotide amplification. For this purpose, it is preferred 
that the artificial nucleotide used shows a preference of base pairing with another artificial 
5 nucleotide over a conventional nucleotide (i.e.. dA. dT, dG. dC and dU). In one embodiment, 
one or more artificial nucleotide XTP (2-amino-6-(N,N-dimethylamino)purine 5'-Triphosphate) 
or -YTP (Pyridin-2-one Ribonucleoside S'-Triphosphate) are used because dXTP and dYTP 
exhibit base-pairing preference with each other over the conventional nucleotides, although a 
sUght preference for dXJIP also exists (Ohtsuki et al., supra). 

10 The oUgonucleotide prima: ofthe present invention (e.g., the first oligonucleotide 

primer) may comprise a sequence (e.g., the sample-specific sequence tag) that is GC rich at its 5' 
end (i.e., a continuous stiretch of nucleotides including the 5' terminal nucleotide) and AT rich as 
its 3' end (i.e., a continuous stretch of nucleotides including tiie 3' terminal nucleotide). The use 
of a sequence which is GC rich at 5' end and AT rich at 3' end increases the specificity of primer 

15 annealing because ATs fomi weaker base parings than GCs. Therefore the specificity of 
polynucleotide synthesis and ampUfication may be increased. 

B. The First Oliponucleotide Primer for The First Str and cPNA Synthesis 

In the subject metiiod of tiie invention, a first oUgonucleotide primer is used for tiie 
synthesis of the first sti^d cDNAs. In one embodiment, tiie first oUgonucleotide primer is also 

20 designed witii sequences that serve as templates for otiier primers to produce an amplification 
product The first oligonucleotide primer can be between 20 and 100 nucleotides m lengfli, 
preferably between 30 and 60 nucleotides in lengfli, more preferably between 30 aid 45 
Nucleotides m lengtti,; stiU more preferably between 34 and .42 nucleotides in length. 

One unique feature of the instant invention is that two or more samples can be analyzed 
25 in the same reaction mixture. For tiiis purpose, tiie origins of sample sources need to be properly 
identified. Preferably, the first oligonucleotide primer comprises a sample-specific tag. For 

■example, flie first oligonucleotide primer for synthesizing first stiand cDNAs from sample A 
^comprises a sample-specific sequence tag A; tiie first ohgonucleotide primer for syntiiesizing 

first sti^d cDNAs from sample B comprises a sample-specific sequence tag B. The 
30 employment of such first oUgonucleotide primer comprismg a sample-specific tag provides it 
mechanism on which subsequence polynucleotide syntiiesis and amphfication products can be 

■.'Kt 
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distinguished according to their sample sources. For example, cDNAs or amplified products 

from sample A would comprise sample-specific tag A, which are distinguishable from cDNAs or 

amplified products from sample B comprising sample-specific tag B. The sample -specific 

sequence tag maybe between 15 and 60 nucleotides in length, preferably, between 18 and 40 

5 nucleotides in length, more preferably, between 20 and 30 nucleotides in length, still more 

preferably, between 20 and 24 nucleotides in length. 

The sample specific sequence tag according to the invention may be a polynucleotide 
sequence (i.e., sample-specific sequence tag) or it may be any other identifiable tags known in 
the art. The sample-specific sequence tags for different first oUgonucleotides (i.e., different 
10 samples) may be different in their nucleotide sequences, or they may differ simply in length. 

The sample-specific tag (e.g., the sample-specific sequence tag) may be located at flie 5' 
terminal, or 3' terminal, or both, or in the ndddle of the first oligonucleotides (i.e., at least one 
nucleotide away from the 5' terminal nucleotide and the 3' nucleotide). In a preferred 
embodiment, the sample-specific tag is located at the 5' terminal of the first oligonucleotide 
15 primer, i.e., there is no other nucleotide on the 5' of the sample-specific sequence. 

The most majority (with the notable exception of histone roRNA) of eukaryotic roRNA 
are synthesized with a 3 '-end "polyA" tail. The poly(A) sequence is not coded in the DNA, but 
is added to the KNA in the nucleus after transcription. The addition of poly(A) is catalyzed by 
the enzyme poly(A) polymerase, which adds -200 A residues to the free 3*-OH end of the 
20 mRNA. The presence of 3 '-end poly(A) tail has an important practical consequence. The 

poly(A) region of mKNTA can base pair with oligo(U) or oligo(dT); and ttds reaction can be used 
to isolate poly(A)'*^ mRNA and to synthesize cDNA from mRNA. oligo(dT) or oligo(dU) 
sequence can be used an a primer to prime the synthesis of the first strand cDNA using reverse 
transcriptase. 

25 The first oligonucleotide primer may further comprise an oligo(dT) or oligo(dU) 

sequence. Preferably, the oligo(dT) or oligo(dU) sequence is located 3' of the sample-specific 
sequence. The oligo(dT) or oUgo(dU) sequence is at least 5 nucleotide in length and may be 
between 5 and 20 nucleotides in length, preferably between 8 and 18 nucleotides in length, more . 
preferably between 12 and 16 nucleotides in lengtH ^ 

30 In one embodiment, a sample-specific sequence tag comprises a general structure of 

about 20 to 24 nucleotides at the 5'-terminal of the first oligonucleotide primerr^In a preferred 
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embodiment, this general structure of about 20 nucleotides is foUowed by an oligo(dT) or 
oUgo(dU) stretch (12-16 residues) at its at 3' end. In a preferred embodiment, the oUgo(dT) or 
oligo(dU) stretch is immediately 3' of the sample-specific sequence. However, there may be a 
non-sample-specific sequence, i.e., a common sequence for both sample A and sample B (e.g., at 
5 least one nucleotide or at least 2, or 3, or 5, or 6, or 10, or up to 20 nucleotides) between the 
sample-specific tag and the oligo(dT) or oUgo(dU) stretch. 

u There is one potential problem associated with the use of oligo(dT) or oligo(dU) in 
cDNA synthesis. Since the polyA tail can be quite long, simply using an oUgo(dT) or oUgo(dU) 
may not accurately initiate reverse transcription right before the non-polyA region. In fact, since 

1 0 the oUgo dT may randomly anneal to any stretch of polyA sequences, the end product of reverse 
transcription from even a single template mRNA can result in a heterogeneous population of 1st 
strand cDNA, each with a different length of polyT at the 5'-end. To overcome this problem, 
two more deoxynucleotides, e.g., VN. can be added to the 3'-end of the oUgo(dT) or oUgo(dU) 
primer, wherein V is any dNTP but dTTP and N is any of the four dNtPs. That way, such 

1 5 primer wiU stably anneal at the junction of the polyA tail and the non-tail region, thus ensuring 
uniform size of the obtained first strand cDNA synthesized firom a given template. In that sense, 
the primer used for the first strand cDNA synthesis is in fact a degenerate oligonucleotide (Smith 
et al., 1997, Biotechniques 23: 274-279). 

In one embodiment, the 3' terminal of the first primer further contains a degenerate 
20 sequence, i.e., a sequence comprising more than one nucleotide compositiotL The first 

oUgonucleotide primer may comprise a degenerate sequence of any length, preferably less than 5 
nucleotides, more preferably 2 nucleotides. In one embodiment, the degenerate sequence in the 
"'first oUgonucleotide primer is VN, where V is dA, dC or dG and N is dA, dT (or dU), dC or dG. 
: fbx B. preferred embodiment, the first oligonucleotide primer comprises a composition Of: 
25 5'(sample-specific sequence tag)2o-24(dT)i2.i6VN3'. hi another embodiment, the furst 

oligonucleotide primer comprises a composition of: 5 '(sample-specific sequence tag)2o.24(dU)i2- 
i.i6VN3' 

The oligo(dT) or oligo(dU) stretch on the first oligonucleotide primer is annealed to 
' complimentary (polyA)-tailed mRNAs in each sample to aiable priming of first stirand cDNA 
30 synthesis. The degenerate nucleotides faciUtates tiie anneaUng of oUgo(dT) or oUgo(dU)and the 
efficiency of first strand cDNA synthesis. The primer-specific sequence tag is unique for each of 
the two samples and provides identification of the origin of the cDNA. ^ / 
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The use of degenerate base may result in a mixture of first oligonucleotide primers tor me 
first strand cDNA synthesis. For example, in one embodiment the reverse transcription is 
conducted with a mixture of specific primers for each sample. These primers have the following 
structure: 5*-(specific sequence tag A)2o-24Ti2-i6AN-3', 5'-(specific sequence tag A) 20-24T12- 
5 16CN-3*, 5'-(specific sequence tag A) 20-24T12-16GN-3' (N is a degenerated base which includes a 
mixture of A, T, C, G) for sample A; and 5'-(specific sequence tag B)2o-24Ti2-i6AISF-3', 5 *- 
(specific sequence tag B)2o-24Ti2-i6CN-3', 5*-(specific sequence tag B)20-24Ti2-i6GN-3' for 
sample B. The sample specific sequence tag need not be identical for each primer in the mixture. 
For example, in one embodiment the reverse transcription is conducted with a mixture of specific 
10 primers for each sample. These primers have the following structure: 5 '-(specific sequence tag 
Al)2o-24Ti2-i6AN-3% 5'-(specific sequence tag A2)2o-24Ti2-i6CN-3% 5'-(specific sequence tag 
A3)2o-24Ti2-i6GN-3' (N is a degenerated base which includes a mixture of A, T, C, G) for sample 
A; and 5'-(specific sequence tag Bl)2o-24Ti2-i6^^-3', 5 '-(specific sequence tag B2)2o-24Ti2- 
16CN-3', 5*-(specific sequence tag B3)2o-24Ti2-i6GN-3' for sample B. 

15 Other nucleotide tags known in the art may be also used as sample-specific tags in the 

subjection invention, for example, as disclosed in Church et al, (1988, Science, 240: 185-188), 
DoUinger, (1994, pages 265-274 in MuUis et al, editors. The Polymerase Chain Reaction, 
Birkhauser, Boston,), Brenner and Lemer, (1992, Proc. Natl. Acad. Sci., 89: 5381-5383), Alper, 
(1994, Science, 264: 1399-1401), Needels et al, (1993, Proc. Natl. Acad. Sci., 90: 10700-10704) 

20 and U.S. Patent Nos. 6,280,935, 6,172,218, 6,150,5 16, 5,846,719, 6,172,214, 6,235,475, aU 

incorporated herein by references. The above patents disclose methods of tracking, identifying, 
and/or sorting classes or subpopulations of molecules by the use of oligonucleotide tags. The 
oligonucleotide tags conqprising oligonucleotides selected firom a minimally cross-hybridizing 
set can be used for sorting polynucleotides by specifically hybridizing tags attached to the 

25 . polynucleotides to their complenients on solid phase supports. Such oligonucleotides each 
. consist of a plurality of subunits 3 to 9 nucleotides in length, A subunit of a minimally cross- 
hybridizing set fomis a duplex or triplex having two or more mismatches with the complement 
of any other subunit of ttie same set The number of oligonucleotide tags available in a particular 
embodiment depends on the number of subunits per tag and on the length of the subunit 

30 Another useftd nucleotide tag is disclosed by U.S. Patent No. 6,013,445 (incoipomted herein by 
reference) which provides a method of polynucleotide sequence analysis based oiSihe, Ugation of 
one or more sets of encoded adaptors to the terminus of a target polynucleotide. Encoded 
adaptors whose protruding strands form perfectly matched duplexes with the complementary 
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auouLu^ ux ulc uu'gci poiynucieouae are iigated, and the identity of the nucleotides in 

the protruding strands is determined by an oligonucleotide tag carried by the encoded adaptor. 

In a preferred embodiment, the first oligonucleotide primer is covalently linked to a solid 
support as described above herein. In this case the reverse transcription reaction generates first 
5 strand cDNAs permanently bound to the support, which allows re-using these first strand cDNAs 
for multiple reactions and easy separation of synthesized second strand cDNAs firona the first 
strand cDNAs. Preferably the 5' of the first oligonucleotide primer is linked to the solid support. 

In another preferred embodiment, the first oligonucleotide primer is synthesized in a 
solution wdthout attaching to a soUd support. 

10 C. The Second Oligonucleotide Primer for The Second Strand cDNA Synthesis 

The subject method of the invention may comprise a second strand cDNA synthesis using 
a second oligonucleotide primer after generating the first strand cDNAs, In this case, the 
synthesized second strand cDNAs or the double strand cDNAs are used as template for 
subsequence amplification. Altematively, the synthesized first strand cDNAs may be directly 
15 used as templates for amplification with synthesizing the second strand cDNAs. 

In one embodiment, the second oligonucleotide primer is also designed with sequences 
that serve as templates for other primers to produce an amplification product. The second 
oligonucleotide primer can be between 20 and 100 nucleotides in length, preferably between 17 
and 60 nucleotides in length, more preferably between 20 and 45 nucleotides in lengtti, still more 

20 preferably between 20 and 25 nucleotides in length. Preferably, the second oligonucleotide 
primer comprises a first arbitrary sequence tag. Also preferably, the second oligonucleotide 
primer for one sample (e.g., sample A) contains fhe.same first arbitrary sequence tag as the 
;£econd oligonucleotide primer for another sample (e.g., sample B). Because of the same first 
arbitrary sequence tag in second oligonucleotide primers used to synthesize second strand 

25 cDNAs firom different samples, a common amplification oligonucleotide primer (e.g., tiie thkd 
*^bligonucleotide primer as described herein after) may be used for tiie amplification of cDNAs 
derived from different samples. . 

The first arbitrary sequeiice tag may be located at the 5', or 3' terminal, or internal (i.e., at 
least one nucleotide away Scorn the 5' terminal nucleotide and the 3* nucleotide) of the second 
30 oligonucleotide primer. Preferably, the first arbitrary sequence tag is located at the 5' terminal, 
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I.e., uxcic i:i uu uiner nucieonae onxne y ot the arbitrary sequence, of the second primer for 
second strand cDNA synthesis. The first arbitrary sequence may be between 5 and 30 
nucleotides in length. 

The second oligonucleotide primer may fiirther comprise a second sequence which is 
5 complementary to a subset (i.e., a plurality) of the first strand cDNAs so as to peraiit the 

synthesis of two or more different second strand cDNAs firom a sample. Preferably the second 
sequence is a short sequence, e.g., less than 25 nucleotides in length, preferably less than 20 
nucleotides in Iragth, more preferably less than 15 nucleotides in length, still more preferably 
less than 10 nucleotides in length, so as to pemiit its annealing to a subset of first cDNAs 
10 synthesized firom a sample. In one embodiment, the second sequence of the second 

oligonucleotide primer is 6-7 nucleotides in length. In another embodiment, the second sequence 
comprises a randomly selected sequence (e.g., 6-7 base) at the 3 '-end so that a subset of cDNAs 
are synthesized from genes (i.e., first strand cDNA) comprising a complementary sequence to the 
second sequence. 

^ 5 In general, the 3 '-end of the second oligonucleotide primer is of great importance since 

there has to be a perfect or near perfect match at the 3 '-end for a polymerase to extend from the 
primer. Preferably, the second sequence is located 3' of the first arbitrary sequence. In one 
embodiment, the second sequence is located immediately 3' of the first arbitrary sequence, i.e., 
there is no other nucleotide sequence between the second sequence and the first arbitrary 

20 sequence. 

In a preferred embodiment, there is a third sequence located between the first arbitrary 
sequence and flie second sequence. Preferably, the third sequence contains one or more 
degenerate nucleotides as described above herein. The third sequence may be between 1 and 1 5 
nucleotides in length, preferably between 1 and 10 nucleotides in length, more preferably 

25 between 2 and 6 nucleotides in length. In one embodiment, the third sequence located between 
the first arbitrary sequence and the second sequence is 4 nucleotides in length (e.g., Z4 in figure 
7). The third sequence may contain all degenerate nucleotides, or it may contain a sequeiice of 
degenerated nucleotides and nondegenerate nucleotides. The degenerated nucleotide in the third 
sequence may be any of dA, dT, dG, and dC, or it may be a nucleotide capable of base pairing 

30 with two or more of dA, dT, dG, and dC. In a preferred embodiment, the third sequence contains 
four degenerated nucleotides, each of which is capable of base pairing with two or more of dA, 
dT, dG, and dC. In a more preferred embodiment the degenerate nucleotide is dl^or 
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nitropyirole. One purpose of including degenerate nucleotide is to increase the overall stabiUty 
of the primer. It has been known that DNA polymerase will be able to read through dTTP 
templates and randomly incorporate any of the four dNTPs when using such a dTTP as template 
inPCR. 

5 The selection of the second and third sequences determines what specific subsets of 

genes from which cDNAs are to be synthesized and amplified. By varying the second and/or 
third sequence, not only the size of the synthesized /amplified products can be adjusted, but also 
the specific gene families to be amplified can be selected. For example, small G proteins all 
have the signature motif of GxGxxG, wherein G is Glycine and X is any amino acid. By using 

10 degenerate oligonucleotides and matching this signature motif, expression profiles of aU smaU G 
pfotemscanbestiidied. Similarly many protein families, such as kinase, phosphatase, has . 
signature motifs and many functional domains or motifs have signatiire sequences (zinc finger, 
etc). These motifs or signature sequences are well documented and tiiere are searchable firee 
databases containing detailed description of these motife/signahore sequences. For example, the 

15 GCG Wisconsin Package sequence analysis tools developed by Accelrys (part of it is formerly 
GCG) offers such a motif search and description, the entire contents of which are hereby ' . 
incorporated by reference. 

In one embodiment, the second oligonucleotide comprises a gaieral structure of 5'(first . 
arbiti-ary sequence)io-i2(third sequence)4(second sequence)6-7 3'. The use of degenerate.base . 
20 (e.g , any of dA, dT, dG, and dC) may result in a mixture of second oUgonucleotide primers for 
the second strand cDNA synthesis. 

The second oligonucleotide primer may or may not be linked to a solid siq)pi6rt as 
fdescribed above herein. In a preferred embodiment, the second oUgonucleotide is not linked to a 
Md support but the first oUgonucleotide is so as to aUow easy separation of synthesized second 
25 strand cDNAs fiom flie first sti^d cDNAs which are Unked to the soUd support after synthesis. 

To increase the specificity and to reduce the background of tiie cDNA synthesis and 
^ampUfication reaction, when designing the first arbitrary sequaace of tiie second oUgonucleotide 
primer, it is preferred fliat its sequence does not demonstiiate significant matches to sequences in 

any mammaUan genomic sequences in GeiiBank database or otiier available databases. By 
30 "significant match", it means that tiiere is less than 30% sequaice identity (e.g., less tiian 20%, 
or less tiian 10%, or less tiian 5%, or less tiian 2% sequence idmtity) between the first arbitrary 
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sequence and any sequence of a species, e.g., human or all mammals, available in the GenBank 

database or other available databases. In some embodiments, where the sample sources are 

known, e.g., from a particular species such as human, dog, or other animals or plants, it is 

preferred that the first arbitrary sequence does not demonstrate significant matches to sequences 

S in the genomic sequences for that particular species in GenBank database or other available 

databases. 

D. T.flhe1ir^p; of ohgonucleotide primers 

The oligonucleotide primer of the present invention may be labeled, as described below, 
by incorporating moieties detectable by spectroscopic, photochemical, biochemical, 
10 immunochemical, enzymatic or chemical means. The method of linking or conjugating the label 
to the oligonucleotide primer depends, of course, on the type of label(s) used and the position of 
the label on the primer. A primer that is useful according to the invention can be labeled at the 5* 
end, the 3* end or labeled throughout the length of the primer. 

A variety of labels that would be ^propriate for use in the invention, as well as methods 
1 5 for their inclusion in the primer, are known in the art and include, but are not limited to, enzymes 
(e.g., alkaline phosphatase and horseradish peroxidase) and enzyme substrates, radioactive 
atoms, fluorescent dyes, chromophores, chemiluminescent labels, electrochemiluminesc^t 
labels, such as Origen™ (Igen), that may interact with each other to enhance, alter, or diminish a 
signal. Of course, if a labeled molecule is used in a PGR based assay carried out using a thermal 
20 cycler instrument, the label must be able to survive the temperature cycling required in this 
automated process. 

Fluorbphores for use as labels in constructing labeled priiners of the invention include 
rhodamine and derivatives (such as Texas Red), fluorescein and derivatives (such as 5- 
bromomethyl fluorescein), Lucifer Yellow, lAEDANS, 7-MeiN-coumarin-4-acetate, 7-OH-4- 
25 CH3-coumarin-3-acetate, 7-NH2-4-CH3-coumarin-3-acetate (AMCA), monobromobimane, 
pyrene trisulfonates, such as Cascade Blue, and monobromorimethyl-ammoniobimane. In 
general, fluorophores with wide Stokes shifts are preferred, to allow using fluorimeters with 

filters rather than a monochromometer and to increase the efficiency of detection. 

"V" 

The labels may be attached to the oligonucleotide directly or indirectly by a variety of 
30 techniques. Depending on the precise type of label or tag used, the label can be located at the 5* 
end of the primer or located internally in the primer, or attached to spacer amis^of various sizes 
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and compositions to facilitate signal interactions. Using commercially available 
phosphoramidite reagents, one can produce oligomers containing functional groups (e.g., thiols 
or primary amines) at the 5*- terminus via an appropriately protected phosphoramidite, and can 
label them using protocols described in, for example, PGR Protoc ols: A Guide to Methods and 
5 Applications, Innis et al., eds. Academic Press, Ind., 1990- 

Mefliods for introducing oligonucleotide functionalizing reagents to introduce one or 
more sulfhydryl, amino or hydroxyl moieties into the oligonucleotide primer sequence, typically 
at the 5' temiinus, are described m U.S. Patent No. 4,914,210. A 5' phosphate group can be 
introduced as a radioisotope by using polynucleotide kinase and ganmia- P-ATP or ganuna- P- 
10 to provide a reporter group. Biotin can be added to the 5' end by reacting an 

a&inothymidine residue, or a 6-amino hexyl residue, introduced during synthesis, with an N- 
liydroxysuccinimide ester of biotin. 

Synthesis of cDNAs 

cDNAs may be prepared and used for amplification according to the subject method of 
15 ttie invention. In some embodiments, first strand cDNAs are prepared and used directly for 
subsequence amplification reaction and analysis. In preferred embodiments of the invention, 
both first and second cDNAs are synthesized. The synthesized fiarst and second strand cDNAs 
may be used for subsequent amplification reactions or the second strand cDNAs may be 
separated fi*om the first strand cDNAs and used for ampUfication reactions. 

20 The preparation of cDNA is well-known and well-documented in the art (e.g., Ausubel et 

al., siipm) and as described below. 

' cDNA may be prepared according to the following method. Total cellulsu: RNA is 
^kolated (as described) and passed through a column of ohgo(dT) or oligo(dU) -cellulose to 
isolate polyA RNA The bound polyA mRNAs are eluted firom tiie column with a low ionic 
25 'strength buffer. To produce cDNA molecules, first oligonucleotide primers comprising 

*' bUgo(dr)n or oligo(dU)n as described above herein (wh^e n is preferably 12-16 nucleotides in 

length) are hybridized to the polyA tails to be used as primers for reverse transcriptase, an 
' "enzyme that uses RNA as a template for DNA synthesis. Alternatively, mKNA species are 
primed from many positions by using short oligonucleotide fragments comprising numerous 
30 sequences complementary to the mRNA of interest as primers for cDNA synthesis. The 

resultant RNA-DNA hybrid (i.e., RNA and first strand cDNA) is converted to^^ double stranded 
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DNA molecule (i.e., first and second strand cDNA) by a variety of enzymatic steps well-known 

in the art (Watson et al., 1992, Recombinant DNA> 2nd edition. Scientific American Books, New 

York). 

In one aspect of the invention, the first strand cDNAs are synthesized using an 
5 oligonucleotide primer comprising a sample-specific sequence, so tiiat the synthesized first 
strand cDNAs are identifiable as to their sample sources. The oligonucleotide primer used for 
first strand cDNA synthesis, therefore, comprises at least an oligd(dT) or oligo(dU) sequence and 
a sample-specific sequence. 

In one embodiment, the fibrst strand cDNAs for sample A are synthesized using inRNAs 
10 isolated firom sample A and an oligonucleotide primer comprising a sample A-specific sequence. 
The first strand cDNAs for sample B are synthesized using mRNAs isolated fix>m sample B and 
an oligonucleotide primer comprising a sample B-specific sequence. The sample A-specific 
sequence may be different firom the sample B-specific sequence by comprising different 
nucleotide identities and/or it may be different from sample B-specific sequence by comprising a 
1 5 different length of nucleotides. 

In a preferred embodimmt, the first oligonucleotide primer is linked to a solid support, 
for example, on beads via covalent links. This is advantageous since once synthesized on beads, 
these first strand cDNA can be easily washed and purified away firom excessive reagents so that 
a direct use of these beads in a separate reaction is possible. Secondly, after second strand 
20 cDNA synthesis (see below), these bound first strand cDNAs can be separated firom the second 
strand cDNAs by denaturing the double strand DNA so that they can be used for other related or 
unrelated experiments, firo example the separated second strand cDNAs can be amplified by 
subsequent amplification reaction. 

Preferably, the synthesis of the first strand cDNA is a reverse transcription reaction. The 
25 first strand cDNA is prq)ared by contacting the RNA sample with the first oligonucleotide 
primer and requisite reagents under conditions sufficient for reverse transcription of the RNA 
template in the sample. Requisite reagents contacted with the primers and RNAs are known to 
those of skill in the art and will generally include at least an enzyme having reverse transcriptase 
activity and dNTPs in an appropriate buffer mediimi. > 

30 A variety of enzymes, usually DNA polymerases, possessing reverse transcriptase 

activity can be used for the first strand cDNA synthesis step. Examples of suitable DNA 
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polymerases include the DNA polymerases derived from organisms selected from the group 
consisting of a thermophilic bacteria and archaebacteria, retroviruses, yeasts, Neurosporas, 
Drosophilas, primates and rodents. Preferably, the DNA polymerase will be selected from the 
group consisting of Moloney murine leukemia virus QA-MLV) as described in U.S. Patent No. 

5 4,943,53 1 and M-MLV reverse transciptase lacking RNaseH activity as described in U.S. Patent 
No: 5,405,776 (the disclosures of which patents are herein incorporated by reference). Avian 
myeloblastosis virus (AMV), human T-cell leukemia vims type I (HTLV-I), bovine leukemia 
viHis (BLV), Rous sarcoma virus (RSV), human immunodeficiency virus (HIV) and Tliermus 
aquaticus (Taq) or Thermus thermophilus (Tth) as described in U.S. Patent No. 5,322,770, the 

10 disclosure of which is herein incorporated by reference, as well as BcaBEST™ DNA 
Polymerase as described in U.S. Patent No. 5,436,149 (the disclosure of which is herein 
incorporated by reference). Suitable DNA polymerases possessing reverse transcriptase activity 
may be isolated from an organism, obtdned commercially or obtained from cells which express 
high levels of cloned genes encoding the polymerases by methods known to those of skill in the 

15 art, where the particular manner of obtaining the polymerase will be chosen based primarily on 
factors such as convenience, cost, availability and the like. 

The various dNTPs and buffer medium necessary for first strand cDNA synthesis through 
reverse transcription of the primed RNAs may be purchased commercially from various sources, 
where such sources include Clontech, Sigma, Life Technologies, Amersham, Boehringer- 
20 Mannheun. Biiffer mediums suitable for first strand synthesis will usually comprise buffering 

agents, usually in a concentration rangmg fix)m 10 to 100 ixM which typically support a pH in the 
range 6 to 9, such as Tris-HCl, HEPES-KOH, etc.; salts containing monovalent ions, such as 
KCl, NaCl, etc., at concentrations ranging from 0-200 roM; salts containing divalent cations like 
; MgCLsub.2, Mg(OAc) etc, at concentrations usually ranging from 1 to 10 mM; and additional 
25 reagents such as reducing agents, e.g. DDT, detergents, albumin, polyalcohols (glycerol) and the 
like. The conditions of the reagent mixture will be selected to promote efficient first strand 
synthesis. Typically the primer will first be combined with the KNA sample at an elevated 
fempCTature, usually ranging from 50 to 95 "^C, followed by a reduction in temperature to a range 
betwem about 0 to 60 °C, to ensure specific annealing of the primers to their corresponding 
30 ^KNAs in the sample. Following this annealing step, the primed RNAs are then combined with 
dNTPs and reverse transcriptase under conditions sufficient to promote reverse transcription and 
first strand cDNA synthesis of the primed RNAs. By usmg appropriate types of reagents, all of 
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the reagents can be combined at once if the activity of the polymerase can be postponed or timed 

to start after annealing of the primer to the RNA. 

In some embodiments, the first strand cDNAs are used as template for the synthesis of 
second strand cDNAs. 

5 Optionally, RNAs may be removed before the synthesis of second strand cDNA by either 

RNase H digestion or by treatment with 0. 1 - IM NaOH. 

Since the expression profile of any given sample can be quite compUcated and the 
resolution of any system is limited to certain extend, it is beneficial to selectively ampUfy a 
subset, rather than the whole set, of expressed gene. This can also be useful since in certain 

10 situations, only a given subset of genes might be of interest and it will be beneficial to filter out 
oth^ genes not of interest to improve signal-to-noise ratio. If, however, an analysis of the 
complete genome is desired, multiple runs using different primer sets can be easily achieved if 
the first strand cDNA is bound to a solid support (see above). Therefore, the identity of the 
second strand primer (i.e., tiie second oligonucleotide primer) will define which subset of 

15 expressed genes gets amplified. 

The second strand cDNA is annealed to the first strand cDNA and forms a complete 
double stranded DNA copy of the original niRNA. 

The composition of second oligonucleotide primer defines the subset of all expressed 
genes that will be synthesized. In general, the 3 *-end of the primer as described herein above, 
20 which is most important for DNA polymerase priming, contains a short sequence (for example, a 
6-7 bp sequence) which serves to select the cDNA molecules to be synthesized. 

The occurrence of such short 3 '-end priming sequences in expressed portion of 
mammalian genome can be estimated. For example, if a 6 bp palindromic sequence is used, 
depending on particular sequence used, about 5,000-10,000 occurrence are expected using 

25 current mouse sequence data (e.g. 1 .5x10^ bases in sequenced mouse.cDNAs). Since particular 
cells and tissues express only a portion of all genes in the total genome (10-30%), and because 
imder commonly used PGR conditions transcripts longer than 2,000 bases are unlikely to be . 
amplified, 500-2,000 individual transcripts are expected to be detected. It is estimated that in a 
single reaction using such primers, about 5-10% of the expressed genome (5,000 / 1.5x10* 

30 (frequency) X 2,000 (size of amplifiable firagment) X 100 = 6.7%) can be covered. Therefore, it 
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is anticipated that about 20 separate reactions (using the same genetic sample) should cover a 
considerable portion of all transcribed sequences in a single manunalian genome. 

Certain bases not commonly found in natural polynucleotides may be included in the 
polynucleotides of the present invention and include, for example, inosine and 7-deazaguanine. 
5 Complementarity need not be perfect; stable duplexes may contain mismatched base pairs or 
unmatched bases. Those skilled in the art of polynucleotide technology can detemiine duplex 
stability empirically considering a number of variables mcluding, for example, the length of the 
oligonucleotide, base composition and sequence of the oligonucleotide, ionic strength, and 
incidence of mismatched base pairs. 

10 Stability of a polynucleotide duplex is measured by the melting temperature, or "Tm". 

The Tm of a particular polynucleotide duplex under specified conditions is the temperature at 
which half of the base pairs have disassociated. The melting temperature of a double strand 
DNA molecule depends markedly on its base composition. DNA molecules rich in GC base 
pairs have a higher Tm than those having an abundance of AT base pairs. The dependence of Tm 

15 on base composition is linear, increasing about 0.4°C for every percent increase in G-C content. 
GC base pairs are more stable than AT pairs because thek bases are held together by three 
hydrogen bonds rather than by two. In addition, adjacent GC base pairs interact naore strongly 
with one another than do adjacent AT base pairs. Hence, the AT-rich regions of DNA are the 
easier to melt. 

20 A major effect on Tm is exited by the ionic strength of the solution. The Tm increases 

le.e^'C for every tenfold increase in monovalent cation concentration. The most commonly used 
icondition is to perfoim manipulations of DNA in 0,12 M phosphate buffer, which provides a 
monovalent Na^ concentration of d.l8M, and a Tm of the order of 90^C. The Tm can be greatly 
Varied by performing tiie reaction in the presence of reagents, such as formamide, that destabilize 
25 hydrogen bonds. This allows the Tm to be reduced to as low as 40X with tiie advantage that the 
DNA does not suffer damage (such as strand breakage) that can result firom exposure to high 
^WpRrahires (S;tryer. Biochemistry. 1998, 3"^ Edition. W.H. Freeman and Co., pp.81-82 and 
Lewin, Genes II, 1985. John Wiley & Sons, p.63-64). 

The synthesized second strand cDNAs can be optionally separated fix>m the first strand 
30 cDNAs synthesized linked to a solid support. The bound first strand cDNAs can then be isolated 
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and used later in other reactions. Alternatively, these bound double strand cDNAs can be used 

directly in subsequent PGR. 

In one embodiment, the newly synthesized second strand cDNAs is separated from the 
bound first strand cDNAs by, for example, exposing cDNAs to a denaturing temperature, i.e, a 
temperature higher than Tm. The bound first strand cDNAs can then be reused for further 
analysis, by using a different oligonucleotide primer to generate a new pool of second strand 
cDNAs for analysis. 

Alternatively the specific pool of cDNA firagments can be generated by enzymatic 
digestion of the double stranded DNA by the action of an appropriate restriction enzyme (e.g. 
recognizing the introduced palindromic site) and by the ligation of ttie specific adapter which 
contains a specific sequence "C" and 5 ' end and a siagle-stranded overhand compatible with the 
overhand generated by the restriction enzyme. 

Amplification 

Synthesized cDNAs (e.g., first strand, or second strand or double strand) are used to 
generate amplified products for analysis. In the subject invention, PGR amplification is 
preferred although other amplification methods known in the art can ialso be used (e.g., LGR, and 
NSBA). 

PGR methods are well-known to those skilled in the art, such as described in Mullis and 
Faloona, 1987, Methods Enzymol, 155: 335, Saiki et al., 1985, Science 230:1350, and U.S. 
Patent Nos. 4,683,202, 4,683,195 and 4,800,159, herein incorporated by reference. In its 
siinplest fonn, PGR is an f/i v//ro method for the enz^ 

sequences, using two oligonucleotide prirners that hybridize to opposite strands and flank the 
region of interest in ttie target DNA A repetitive series of reaction steps involving template 
denaturation, primer annealing and the extension of the annealed primers by DNA polymoBse 
results in the exponential accumulation of a specific firagment whose termini are defined by tiie 5' 
ends of the primers. PGR is reported to be capable of producing a selective enrichment of a 
specific DNA sequence by a factor of 10^ 

In the present invention, PGR is performed using template DNA, i.e., cDN^ (at least Ifg; 
more usefiiUy, 1-1000 ng) and at least 25 pmol of oUgonucleotide primers (i.e., the third and 
fourth oligonucleotide primer). For exanq)le, a typical reaction mixture includgg: 1-lOOOpg of 
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cDNA, 25-100 pmol of oligonucleotide primer, 2.5-10 jil of a suitable lOxbuffer, 0.4-2 \i\ of 10 
fxM dNTP, 2.5 units of Taq DNA polymerase and deionized water to a total volume of 25-100 iiL 
Mineral oil may be overlaid and the PGR is performed using a programmable thermal cycler. 

Preferably, the third oligonucleotide primer comprises the sample-specific sequence of 
5 the first oligonucleotide primer. In a preferred embodiment, the third oligonucleotide primer 
comprises the whole or a portion of the sample-specific sequence arid is capable of armealing to 
its complementary sequence (i.e., in second cDNAs). This embodiment preferably also 
employees a fourth oligonucleotide primer (i.e., with opposite orientation to the third 
oligonucleotide primer). Preferably, this fourth oligonucleotide primer comprises the first 
10 arbitrary sequence of the second oligonucleotide primer. If the second strand cDNAs of different 
saniples are synthesized using the same second oligonucleotide, the same fourth oligonucleotide 
priSier may be used to the amplification of the cDNAs by PGR. 

The use of the third oligonucleotide primer comprising a sample-specific sequence 
ensures the amplified products can be identified according to their sample origins without losing 
15 track of their identity. 

The length and temperature of each step of a PGR cycle, as well as the number of cycles, 
are adjusted according to the stringency requirements in effect Armealing temperature and 
timing are determined both by the efficiency with which a primer is expected to aimeal to a 
template and file degree of mismatch that is to be tolerated. The ability to optimize the 

20 stringency of primer annealing conditions is well within the knowledge of one of moderate skill 
in the art. An annealing temperature ofbetween30*C and 72*C is used. Initial denaturation of 
the-template molecules nomially occurs at between 92**C and SP^'C for 4 minutes, followed by 
20,-40 cycles consisting of denaturation (94-99"C for 1 5 seconds to 1 minute), annealing 
(temparature determined as discussed above; 1-2 rniriutes), and extension (72*C for 1-3 minutes). 

25 Preferably, the amplified products are labeled with detectable labels so that their identity and 
abundance may be detected. Detectable labels as defined herein above (e.g., fluorescent, 
radioactive, or colorimetric labels) maiy be linked to the amplified products by various means. 
For example, a dNTP may be labeled which leads to the labeling of an amplified polynucleotide 
once the dNTP is incorporated into the Polynucleotide. Alternatively, a primer used for 

30 amplification may be labeled which also leads to the labeling of an amplified Polynucleotide. In 
addition, a labeled probe (e.g., an oligonucleotide complementary to an amplified product) may 
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be used to hybridize to the amplified product therefore generating a detectable signal with the 

amplified product (i.e., in a hybridization assay). 

In a preferred embodiment, the 5'-end of each sample-specific PGR primer (i.e., the third 
oligonucleotide primer) is linked to a specific fluorescent label so that the lineage of the PGR 
5 product can be easily traced by their fluorescent marker according to the sample origin. In 

addition, the strength of the fluorescent signal is directly proportional to the amount of the PGR . , 
product. By recording the fluorescent strengths of a given product, a ratio between PGR 
products of different origin can be obtained. 

Althougih each sample preferably has its specific fluorescent label, the same fluorescent 
10 label can be used by more than one sample. For example, if the fluorescin-tagged third 

oligonucleotide primer for sample A is 1 base shorter (or longer) than that of sample B, and if ttie 
separation means is sensitive enough to detect 1 bp difference, then the "same" PGR firagment 
originating firom these two samples will be resolved as two close peaks differing in size by Ibp 
(e.g., by denaturing high performance liquid chromatography (DHPLG)). The same recording 
15 arid calculation can then be effectuated if the size difference is accounted for. The same strategy 
may be useful for more than 2 samples. 

Although fluorescent label is the preferred label, other labels can also be used to achieve 
the same purpose, ttthe lab is isotope with different molecular weights (i.e. P31 vs. P32 vs. P2f3; 
016 vs. 018, etc), primer for sample A may be '^heavier" than primer for sample B. Such 
20 difference may result in the PGR product of different origin to be separated by a detectable 
margin, for example, on mass spectrometry, so that a ratio can be calculated based on these 
closely related peaks. 

In oiie embodunent, the collected cDNA prepared from two or more samples are 
combined and subjected to PGR amplification using two pairs of primers (FIG. 3). Primer 4 is a 

25 conmion primer and it is either identical to primer 2, or identical to the 5 '-end unique sequence 
in primer 2. Primers 3 A aiid 3B are identical to the sample-specific tag sequences incorporated 
into DNA during reverse transcription. In addition each of these primers contains specific 
fluorescent label at the 5*-end of the primer. That will ensxire that the PGR products resulting 
fix>m these two separate samples will be separately labeled by different sample-spdcific 

30 fluorophors, even though PGR is carried out in the same reaction mixture. The primers 3 A and 
3B can represent a mixture of corresponding primers 3A1, 3A2, 3 A3 and 3B1, 3B2,* 3B3 
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identical to the specific sequences Al, A2, A3 and Bl, B2, B3 which were introduced during the 
reverse transcription reaction. Each of these primers may contain a unique fluorescent dye. 

The use of a primer mixture instead of a single primer for each sample will increase the 
number of genes that could be analyzed in a single reaction. Since specific sequences A1-A3 
5 and.BlrB3 are incorporated depending on the nucleotide preceding polyA tail in the mRNA and 
the products of their amplification appear in different fluorescent channels, this method can 
distinguish between DNA firagments that have similar size but differ in the nucleotide preceding 
polyA sequence, 

. .In some embodiments of flie invention, a nested amplification is performed using 
10 amplified products in a preceding amplification reaction as templates. The use of nested PGR 
can also greatly enhance the yield of the species-specific product, therefore the sensitivity of the 
assay, when a single primer pair fails by itself. Preferably, one of the nested PGR primer 
contains a sample-specific sequence so as to keep tracking flie sample origin of the amplified 
product Also preferably, the primer containing the sample-specific sequence is labeled with 
1 5 specific detectable label to permit the detection and analysis of amplified products: For example, 
a method comprising a nested PGR involves two sequential PGR reactions. After multiple cycles 
of PGR (e.g., 10 to 40, or 10 to 30 or 10 to 20 cycles) with the first pair of primers (e.g., with the 
third and the fourth oligonucleotide primers), a small amount aliquot of the first reaction (e.g., 
1 1x1 of a 50^1 reaction) serves as the template for a second multiple cycles of PGR reaction (e.g., 
20 10 to 40, or 10 to 30 or 10 to 20 cycles) with a new set of primers that anneal to sequences 
internal to, or nested between, the first pair. 

Methods for designing nested primers and for performing nested PGR are known in 
art rf^ee Current Protocol in Molecular Biology, supra). The general criteria for selecting primers 
as described above also apply to the design of nested primers. Both nested primers need to 
25 anneal to sequences intemal to (e.g., within) the first pair of primers and at least one of the 
nested primers, however, according to the subject invention, needs to be contain a sample- 
spebific sequence. 

. Separation and Detection of Amplified Products 

During PGR amplification, starting fi:om a predetermined time or cycle (fdr example, the 
30 5* cycle, or the 8* cycle, or the 10^ cycle or other cycle), an aliquot, e.g., between 1% to 40% 
(v/v) of the reaction mixture, is automatically withdrawn after each cycle, and the reaction 
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mixture is replenished with equal volumes of fresh components such as dNTP, fluorescent 

labeled primers and DNA polymerase. The withdrawn sample is then separated and analyzed. 

Methods for detecting the presence or abxmdance of polynucleotides are week known in the art 

and any of them can be used in the subject method of the invention so long as they are capable of 

separates individual polynucleotides although it is preferred that quantitative analysis can be 

performed simultaneously- Useful methods for the separation and analysis of the amplified 

products include, but are not limited to, electrophoresis (e.g., capillary electrophoresis (CE)), 

chromatogn^hy (dHPLC), and mass spectrometry. 

In one embodiment, CE is a preferred separation means since it provides exceptional 
separation of the polynucleotides in the range of at least 10-1,000 base pairs with a resolution of 
a siiigle base pair. CE can be performed by method well known in the art, for example, for 
example, as disclosed in U.S. Patent Nos. 6,217,731; 6,001,230; and 5,963,456, incorporated 
herein by reference. Recently developed throughput CE apparatuses are available commercially, 
for example, the HTS9610 ffigh throughput analysis system and SCE 9610 fully automated 96- 
capillary electrophoresis genetic analysis system from Spectrumedix Corporation (Sta:te College, 
PA); P/ACE 5000 series and CEQ series from Beckman Instruments Inc (Fullerton, CA); and 
ABI PRISM 3100. genetic analyzer (Applied Biosystems, Foster City, CA). Near the end of the 
CE column, the amplified DNA fragments will pass a fluorescent detector which meaisures 
signals of both fluorescent labels. These apparatuses provide automated high throughput for the 
detection of fluorescence-labeled PCR products. 

The employment of CE in the subject method permits higher productivity compared to 
conventional slab gel electrophoresis. The separation speed is limited in slab gel electrophoresis 
because of the heat produced when the high electric field is applied to the gel. Since heat 
elimination is very riapid from the large surface area of a capillary, a higher electric field can be 
appUedtocapiUaryelectrophoresis, thus speeding up the separation process^ By using a 
C£^illary gel, flie separation speed is increased about 10 fold over conventional slab-gel systems. 

With CE, one can analyze multiple samples at the same time which is essential for high- 
tfarougtput. This is achieved by employing multi-capillary systems in one embodiment of the 
invention. However, the detection of fluorescrace from DNA bases may be complicated by the 
scattering of light from the porous matrix and capillary walls. A confocal fluorescence scanner 
maybe used to avoid ligjht scattering (Quesada et al., 1991, Biotechniques 10:616-25). 
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. In one embodiment, the subject method measures how many copies of a particular cDNA 
(i e mRNA) contained in the original sample used as template for PGR ampUfication. To 
determine the nmnber of original copies, the efficiency of the nucleic acid extraction, as well as 
the efficiency of each PGR reaction must be known. Further, the detection step reveals how 
many copies of the target sequence have been made, but not how many copies were contamed m 
the original sample. 

In a preferred embodiment, differences in gene expression, rather than the exact numbers ! 
of copies of the target sequence contained in the sample is measured. The detected fluorescent 
signal strength (e.g.. following GE separation) can be recorded and used to determine the relative 
ratio of each peak from the two samples (FIG. 4). hi a preferred embodiment, cDNAs derived 
from two or more samples are amplified in the same PGR reaction. Each sample is ampUfied by 
a common primer (e:g;, the fourth oligonucleotide primer) and a sample specific primer, 
therefore cDNAs from different samples will compete for the same common primer. Because of 
this competition, the ratio of the amounts of the ampUfied products from two samples reflects tiie 
ratio of the amounts of the initial target polynucleotide in each of the two samples. For example, 
aratio (e.g.. sample A/sample B) of 1 indicates tiiat same initial amount of tiie target 
polynucleotide in the samples A and B. i.e.. that the target polynucleotide is 
expressed in the two samples. Aratioofgreater thanl (e.g.. sample A/sample B) indicates a 
higher amount of the target polynucleotide in sample A than in sample B. A ratio of smaUer than 
1 (e.g.. sample A/sample B) indicates a less amount of flie target polynucleotide in sample A 
than -m sample B. In both of the above cases, the target polynucleotide is differentiaUy expressed 
in the two samples. It is expected that the amount of majority polyniicleotides present in two 
samples (i.e., the expression level of tiiese polynucleotides) are about the same therefore the ratio 
of amplification wii remain constant (e.g., at about 1). 

In another preferred embodiment the signal intensity for each PGR fragment (and 
therefore for each gene) separated by GE WiU be plotted as a fimction of cycle nmnber. TTie 

signal intensity can be represented by total, area of peak on the electrophoregram. A threshold 
cycle number (a) will be calculated as a cycle number at which signal intensity of PGR 
fragment WiU reach a set threshold value (for example 10 standard deviations of backgromid 

) value of signal intensity) for each amplified gene. Operational differential expression of 

particular gene is determined as a difference in threshold cycle number (a) for this gene in two 
(or more) samples more than one cycle in value. The threshold cycle number is further used to 
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derive copy number for each gene and to measxire the difference in the expression by a ratio of 

copy numbers for gene in two or more samples (Fig 5a). 

The method also comprises generating an plot of the rate of signal intensity change as a 
function of number of amplification cycles [derivative of Signal Intensity as a function of cycle 
numer, d(Signal Intensity)/d(cycle number)] for each amplified gene. The alternative threshold 
cycle (aCt), determined as a cycle number corresponding to the maximal value of d(Signal 
hitensity)/d(cycle number) for each amplified gene fi-om one sample, is compared to the aCt for 
the same gene fi:om another sample. Difference in one cycle between aCt values for the same 
gene in two or more samples is defined as alternative operational differential expression (Fig. 
5b). 

Also preferably, the method further comprises collecting PGR fragment or PGR 
firagnients corresponding to one or more genes which display operational differential ejqpression 
or alternative operational differential expression, and identifying the sequence of the one or more 
genes. 

The ratio of a particular polynucleotide in two samples may be fiirther measured against a 
common ratio for determining whether it is differentially expressed between the two samples. 
ITie term "common ratio" as used herein means a relatively constant ratio of all . genes expressed 
between two samples. It reflects a global change (aniount of total startmg material) rather than a 
specific change caused by certain events such as activatioii of a particular signal transduction 
pathway in a treated sample as compared to an untreated sample. By comparing ratio of 
expression of a particular gene with this common ratio, it will be immediately apparent whether 
the expression of that particular gene is different between the samples being compared. 

If the two samples are amplified in separate PGR reactions, an internal control may be 
provided for eiaich PGR amplification and the amplification of each sample is first normalized 
according to internal control before the ratio is calculated. The use of intemal control for 
quantitative PGR is well-knoym in the art, for example, as described in Ausubel et al. There are 
two basic types of control: the first is commonly known as exogenous control (Gilliland et al, 
(1990) PGR Protocols, hmis et al. ed., pp. 60-69, Academic Press; Wang et al. (1989) Proc. Natl. 
Acad. Sci. USA 86:9717-9721, both of which are specifically incorporated hefein4)yj-eference), 
and the second, is known as endogenous control (Dveksler et al. (1992) PGR Methods and 
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AppUcations 6:283-285; Spanalds (1993) Nucleic Acids Research 21 :3809-3819, both of which 
are specifically incorporated herein by reference). 

Exogenous control involves the use of an artificially introduced nucleic acid molecule 
that is added, either to the extraction step or to the PGR step, in a known concentration. The 

5 concept of adding an exogenous nucleic acid at a known concentration in order to act as an 
internal standard for quantitation was mtroduced by Chelly et al. (1988) Nature 333:858-860, 
which is specifically incorporated herem by reference. Therefore, utilizing a control firagment 
fliat is amplified with the same primers as the target sequence more accurately reflects target 
sequence amplification efficiency relative to the intemal standard (see, for example, WO 

10 93/02215; WO 92/11273.; U.S. Patent Nos. 5,213,961 and 5,219,727, all of which are 

incorporated herein by reference). Similar strategies have proven effective for quantitative 
measurement of nucleic- acids utilizing isothermal amplification reactions such as NASBA 
(Kievits et al., 1991, J Virol Methods. 35:273-86) or SDA (Walker, 1994, Nucleic Acids Res. 
22:2670-7). 

1 5 The use of an endogenous control regulates variations in extraction efficiency. Control 

choice is important in that several requirements must be met in order for it to work. The first 
requirement is that the copy niraiber of the control must remain constant; the second. 
Requirement is that the control must amplify with similar efficiency to the sequence bemg 
monitored. Several constitutively expressed genes have been considered as control candidates, 

20 since the expression of these genes is relatively constant over a variety of conditions. Examples 
include, but are not limited to, the p-actin gene, the glyceraldehyde-3-phosphate dehydrogenase 
gene (GAPDH), and the 168 ribosomal RNA gene. While these genes are considered to be. 
constitutively expressed.. . 

Threshold may be set up arbitrarily for the classification of differentially expressed 
25 polynucleotides. For example, a polynucleotide witii a ratio of larger than 1 .2 or less than 0.5 is 
regarded as a differentially expressed polynucleotide (i.e., gene) m the two samples according to 
one embodiment. Polynucleotides identified as differentially expressed can be collected, e.g., by 
a fi-action collector, and the identity of the gene can be established flirough routine DNA 
sequencing. Fraction collectors are corumercially available, for example, firom Bio-Rad 
30 Laboratories (Hercules, CA). " - 
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In another embodiment, since the CE can be calibrated for determining molecular 

weights of eluted PGR fragments, and since the exact sequence used to selectively synthesize the 

second strand cDNA is known, the identity of each PGR fragment of interest can be readily 

deteraiined based on the available genome sequence database information. The human genome 

5 has been completely sequenced, so are a few other organisms such as E. coli, yeast, C. elegan 

and Drosophila. With the fast advancement in DNA sequencing technology, the whole genomes 

of most other organisms of interest will soon completely sequenced! 

One of the unique features of this method of transcription profiling is its ability to 
monitor PGR throughout the entire amplification process. In contrast, existing methods such as 
10 differential display only measure final quantities of PGR products. The advantage of this method 
is that it can detect those changes in gene expression that would otherwise be missed using other 
conventional methbds. This aspect can be illustrated by the typical cur\^es of PGR product 
accumulation (see FIG. 5). 

At the beginning of liie PGR amplification reaction, the amoxmt of PGR product is below 

15 the detection limit of most instruments and no quantitative difference can be observed. For flie 
detection of rare gene transcripts which are normally present, at the level of several copies per 
cell, monitoring PGR products at very late stages will be necessary. Typically, detection of these 
genes will be difficult since the reaction is typically stopped long before those rare transcripts are 
amplified to a detectable level. The middle section ofthe amplification curve, when the signal . 

20 arises above the detection limit and enters a logarithmic phase, constitutes the best signal for 

detecting quantitative differences in gene expression. However, due to the exponential nature of 
the reaction, this phase is relatively short and lasts only a few cycles before the reaction goes into 
a later stationary phase. In this later stationary phase of PGR amplification, accumulation of 
PGR: products are saturated due to several factors such as lack of additional substrates, or lack of 

25 polymerase, or inhibition of polymerase activity by the product, or a combination thereof. 
Obviously, this later stationary phase once again provides little opportunity for detecting 
quantitative differences in gene expression. Therefore, methods that quantify PGR product after 
a predetermined number of cycles can only identify genes that happens to be in the logarithmic 
phase ofthe amplification and would thus miss those genes which are only differentially 

30 detected either earlier or later in the amplification process. ^ 

The instant invention overcomes this limitation since it defines a complete amplification 
curve for each individual amplified fi-agment Moreover, it provides a quantitative basis for 
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measuring expression diflFerences. In the practice of real time quantitative PGR, the 
experimentally defined parameter Ct As used herein, the term "C" refers to the cycle number at 
which the signal generated firom a qxiantitative PGR reaction first rises above a ^threshold", i.e., 
where there is the first reliable detection of amplification of a target nucleic acid sequence. 
5 "Reliable" means that the signal reflects a detectable level of amplified product during PGR. Gt 
generally correlates with starting quantity of an unknown amount of a target nucleic acid, e.g., 
lower amounts of target result in later Gt. Gt is Imked to the initial copy number or conceiitration 
of starting DNA by a simple mathematical equation: 

Log(copy number) = aGt+ b , where a and b are constants. 

10 Therefore, by rheasuring Q for the fragments of the same gene originating from two 

different samples, the original concentration of this gene in thelse samples can be easily 
evaluated. 

The usual source of concerns regarding the use of PGR amplification for expression 
profiling is a potential bias of amplification. Specifically, some sequences are amplified with a 

15 better efficiency than others. This bias can change the final representation of PGR products 
when compared with the starting sample. However, such bias will not affect the instant 
invention because the invention provides an embodiment where amplification of a cDNA target 
from different samples is performed in the same reaction mixture and with the use of a common 
PGR primer. Therefore the ratio of the amplified PGR product originating from different 

20 saniples will only be affected by the ratio of original amount of cDNA in each sample but not by 
the efficiency of amplification. For a given PGR reaction, although amplification of one PGR 
target may still be biasejd against another, this ratio ^hall remain constant without regard to the 
size or flie composition of each PGR product Thus, this mefliod provides an embodiment which 
bypasses such problem by measuring relative, instead of absolute, amplification of two samples 

25 in the same PGR reaction. 

Other potential, problems can arise late in the amplification when availability of DNA 
polymerase may became a limiting factor of amplification. As a consequence, more abundant 
fragments will kinetieally inhibit amplification of less abundant firagments. The importance of 
this problem cannot be empirically predicted since it depends on the sensitivity of the detection 
30 device. One way to alleviate the problem is to gradually increase concentration of the DNA 
polymerase at the late cycles of amplification. ^ 
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. Another method to address the issue of kinetic bias of the PGR is a novel concept 

(normalized amplification or amplification to steady-state). In one embodiment, we propose to 

include additional step in each cycle of amplification starting with cycles 10-20. This step 

consists of treating the amplification mixture with the restriction enzyme directed against the 

5 palindromic sequence included in the primers 2 (FIG. 6), The more abimdant PGR fragments 
will be preferentially digested by the restriction enzyme simply due to their relative abundance. 
The digestion will eliminate a priming site for the DNA polymerase and therefore will prevent 
further amplification of digested firagments. Undigested PGR fragments, which include less 
abundant DNA fragments will continue to amplify, generating 2 copies of each fragment in the 

10 reaction. By adjusting the concentration of the restriction enzyme and the time of this treatment 
it should be possible to regulate this reaction in such way that it will limit fiirther amplification 
of the PGR fiagments after they will reach certain acceptable concentration. To eliminate the 
difference in the size between digested and undigested Segments corresponding to the same 
gene, the aliquot of the reaction mixture will be treated with an excessive amount of the 

1 5 restriction enzyme. Likewise, single-stranded DNA species arising from the priming of the 
opposite strand of DNA could be eliminated by the treatment with the single-stranded DNases 
(for example Exonucleases I and VII). 

The above description is directed to an embodiment that measures differences between 
two origLoal samples, However, it shoiild be understood that more than two samples could also 
20 be used in the same manner as described with minor ad^tation. For example, by using a tiiird 
sample-specific primer and a third fluorescCTit label, the same method can be used for three 
samples- Similarly, even more samples can also be analyzed using a similar approach. 

Kits for Implementing the Method of the Invention 

The invention includes compositions arid kits for carrying out the various embodiments 
25 of the invention. Preferably, kits of the invention include a first oligonucleotide primer, where 
the first oligonucleotide primer comprises a sample-specific sequence tag and where the sample 
specific sequence tag is GC rich at its 5' terminal and AT rich at its 3' terminal. Preferably, the 
first oligonucleotide is attached to a solid support. Additionally, kits of the invention may 
fittther include a second oligonucleotide primer, a third oligonucleotide primer, or a fourth 
30 oligonucleotide primer, where second oligonucleotide primer may comprise an arbitrary 

sequence tag. Kits may further contain one or more components selected from the group of a 
reverse transcriptase, a DNA polymerase, a reaction buffer, and dNTPs. ^.x 
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Exemp lar y Applications of The Present Methods 

1. Research on Development and Signal Transduction Pathways 

Comparing expression profiles of different biological samples are invaluable for studying 
normal developmental processes. 

5 For example, stem cell diflFerehtiation is characterized by a series of specialization into 

stem cells that are committed to give rise to cells that have a particular fimction. Totipotent 
CTibryonic stem cells may partially differentiate into pluripotent stem cells, which in turn may 
give rise to blood stem cells under certain conditions. These committed blood stem cells will 
respond to a host of cjrtokines or "stimulating factors" en route to their further differentiation 

10 into more specialized blood bells such as red blood cells, platelets, and white blood cells. Diuing 
each step of this complicated process, dramatic changes in overall gene expression profiles occur 
in response to particular cytokines. It is of great interest to determine what are the governing 
factors for these kind of fate determination during stem cell differentiation since a partial or 
complete reversion of these steps may be beneficial in regaining some desirable features that are 

1 5 lost during differentiation. Similar approaches may be desirable for a niunber of other 

developmental processes. The instant invention provides a tool to study such changes in gene 
expression profile during development and thus will be of great value for such research. 

2. Therapeutic Uses and Diagnostic Markers 

The instant invention provides a metiiod to compare expression profiles of different 
20 biological samples, which offers an invaluable means to identify potential drug targets for fiirfher 
research and development and useful diagnostic markers for certain pathological conditions. 

There are at least two types of genes which expression profiles may be changed in 
diseased vs. normal samples. Change in expression profile of one type of genes is causally 
related the disease state. It is the up- or down-regulation of these genes that trigger a series of 

25 events that eventually leads to the development of the padiological condition. By modulating the 
activity of these "causal genes,'' it is possible to reverse the disease state and therefore effective 
treat or alleviate the pathological condition. These genes and their products constitute valuable 
dmg targets, the mere identification of which will be beneficial for ttie long term goal of curing 
the disease. Examples of such genes will include, but are not limited to oncogenes (such as Ras) 

30 and tumor suppressor genes (such as Rb, NFl), 
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, The second type of genes which expression projSles are changed are different in that these 

changes are the result rather than the cause of such disease conditions. Although it may not be 

possible to modulate the activity of these genes in the hope that the disease phenotype will be 

reversed, identification of these genes may nevertheless help early accurate diagnosis of such 

5 disease conditions, thereby facilitating early and effective treatment of such conditions. 

Examples of such genes will include but are not limited to tumor antigens CA125, alpha fetal 

protein, etc. 

In addition, the instant invention can also be employed to study the effects of certain 
treatments on cells, tissues or individual. This is useful for basic and pharmaceutical research 

10 when the effect of a potential drug can be studied and/or predicted based on what signal 

transduction pathways are affected by certain treatments. By identifying the potential target of 
such dmg, certain undesirable side effects might be eliminated by fiirther screening for better 
drugs that only affects desired targets while leaving other unintended targets alone. Dmg 
optimization is also possible since the instant invention provides a means to do high throughput 

1 5 screen to identify improved dmgs that causes larger desirable changes in the intended target. 

3. Other Applications 

Another area where a simple method of transcriptional profiling can be extremely 
instrumental is characterization of cells and organisms that underwent genetic modifications (for 
example transgenic animals carrying a modified version of the gene, overexpressing genes or 

20 missing genes (knock-outs)). Such cells and organisms often display an altered transcription 
profile as a result of the modified function of the targeted gene or as a compensating effect. 
Such changes can point out to the function of the gene by placing it to the particular pathway 
dqfinedby the identity of the differentially expressed genes. It may help to define a 
transcriptional signature of alteration in particular genes and to use such signatures to define 

25 genetic modification in a particular disease by comparison of the transcriptional changes in cells 
or tissues obtained from the disease-affected organisms versus a database of transcriptional 
signatures. . 

4. Business methods 

The instant invention also provides a business method to conduct a pharmaceutical 
30 business. Identification and validation of drug targets are important rate limiting steps in tiie 
drug development process. The instant invention provides a method to quii^kly compare the 
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expression profiles of diseased vs. normal tissues, thereby significantly speed up the process of 
identifying potential lead molecules for drug design. The associated high throughput means will 
also help to speed up the dmg screening as well as drag optimization processes. In addition, a 
number of reliable diagnostic markers can be identified and fiuther developed for diagnosis 

5 purpose. This will not only provide a basis for a pharmaceutical business to carry out the 
identification and development of these drag targets or markers, but also an opportunity to 
license the rights of these initial discoveries to a third party so that they can conduct fiirther 
research and development of the target of their choice. Ih addition, it is also possible to offer the 
service of identifying and developing such drag targets or markers using proprietary technology 

10 of the instant invention. 

The instant invention has the potential to become a powerfiil tool for transcriptional 
profiling as a new. platform for genomic discovery. This system has a potential for 
improvements and fiirther development (e.g. increasing number of samples, creation of the band 
ID database eliminating the need for sequencing, etc.). It will also speed up the whole process of 
15 DNA diagnostics (in particular development of low and medium-density microarrays) by 
providing the initial data to select specific sets of genes for down-stream appUcations. 

The practice of the present invention will employ, unless otherwise indicated, 
CQnveritional techniques of molecular biology, ceU biology, cell culture, microbiology and . 
recombinant DNA, which are within the skill of the art. Such techniques are explained fiilly in 
20 the literature. See, for example. Molecular Cloning: A Laboratory Manual, 2nd Ed., ed. by 
Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, 
Volmnes I and n (D.N. Glover ed., 1985); Oligonucleotide Synthesis (M.J. Gait ed., 1984); . 
Mullis et al.; U.S. Patent No: 4,683,195; Polynucleotide Hybridization (B.D. Hames & S.J. 
• : Higgins eds., 19S4); Transcription And Translation (B.T>. Hames & S.J. Higgins eds., 1984); 
25 B. Perbal,^ PracticdVGuide To Molecular Cloning (1984); the treatise. Methods In Enzymology 
(Academic Press, Inc., N.Y.); Methods Jn Enzymology, Vols. 154 and 155 (Wu et al. eds.); 
Immunochemical Methods In Cell And Molecular Biology (Mayer & Walker eds.. Academic . 
Press, London, 1987). The contents of all cited references (including literature references, issued 
. patents, published patent applications as cited througjiout this application) are hereby expressly 
30 incorporated by reference- 
In particular, isolation of total RNA from biological samples and subsequent purification 
of mRNA for cDNA synthesis is well known molecular biology techniques. The details of 
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experimental procedures are well documented in one or more laboratory reference books listed 

above and other scientific literatures. Commercial kits are also widely available for such 

purposes (for example, Qiagen sells kits for mRNA isolation and GIBCO BRL sells kits for 

cDNA synthesis using reverse transcriptase). PGR amplification, chromatography, capillary 

5 electrophoresis (CE) are all routine molecular biology techniques and thus will not be elaborated 

further. 

EXAMPLES 

The invention is illustrated by the following nonlimiting examples wherein the following 
materials and methods are employed. 

10 Example 1 Preparation of RNA 

RNA may be produced using Trizol reagent and RNeasy Midi^axi Kit fi"om Qiagen by 
following the following procedure. 

Tissues were homogenized hi a .homogenizer at 1 ml of TRIZOL reagent per 50-100 mg 
of tissue for 30 seconds, followed by a final homogenization of 1 minute. The sample volume 
15 should not exceed 10% of the volume of Trizol reagent used for homogenization. The 

homogenized tissiies were left for at least 15 minutes up to an hour, at room temperature, or they 
were stored in — 70°C imtil needed. 0.2 ml of chloroform per ml of Trizol was added and mixed 
. by shakiug. The mixture was iucubated at room temperature for 5 minutes, then centrifiiged for 
5 minutes at 4000 rpm. The upper phase was collected into a separate tube. 0.5 ml of isopropyl 
20 . alcohol per ml of Trizol was added to precipitate RNA, The mixture was put on ice for 5 

. minutes and was centrifuged at 4000 rpm for 10 minutes. The supernatant was removed and the 
. pellet was washed with 1 mi 75% EtOH per ml of Trizol, mixed, and centrifiiged at 4000 rpm for 
5 minutest The sup^natant was moved and the pellet was air dried for 30 minutes to 1 hour. 
After pellet was air dried, the pellet was resuspmded in RNAse free water to a desired 
25 concentration. 

The RNA extracted could be cleaned up by adding the appropriate volume of buffer RLT, 
and mix thoroughly. An appropriate volimie of ethanol (96-100%) was added to the diluted 
RNA and mixed thoroughly by shaking vigorously. The sample was applied to an-RNeasy midi 
spin column or RNeasy maxi spin column and was placed in a 15-ml or 50-ml centrifuge tube 
30 and centrifuge for 5 min at 3000-5000 x g. The flow-through was discarded- , 
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Generally, DNase digestion is not required since the RNeasy silica-membrane technology 

efficiently removes most of the DNA without DNase treatment. However, further DNA removal 

may be necessary for certain RNA applications that are sensitive to very small amo\mts of DNA. 

To remove DNA with DNase, pipet 2.0 ml buffer RWl into spin column and centrifuge for 5 

5 minutes at 3000-5000 x g to wash. Discard the flow through and reuse the centrifuge tube. Add 
20|xl DNase 1 stock solution to 140[il buffer RDD. Mix by gently fUcking the tube, and 
centrifuge briefly to collect residual liquid from the sides of the tube. Pipet the DNase 1 
incubation mix (160jal) directly onto the spin column membrane, and place on the benchtop (20- 
30 C) for 15 min. Pipet 2.0 ml RWl buffer into the spin column, and place on the benchtop for 

10 5 min. Then centrifuge for 5 minutes at 3000-5000 x g. Discard tiie flow through. Reuse the 
centrifuge tube m the following RPE buffer wash step. RNeasy kit is used for remainder of 
protocol by foUowing-the manufacturer's manual instruction. 

Example 2. Reverse Transcription in Solution 

The RNA samples (1-5 were mixed witii 1 |il of dNTPs solution (10 mM) and 
1 5 0.0005-0.5 pM (final concentration in 20 iil mixture) of first oligonucleotide, heated for 7 min at 
70°C, cooled for 2 min at 4°C. The above mixture was then mixed with the reaction mixture (4 
pi RT buffer (250 mM Tris-HCl(pH 8.3 at 25X), 375 mM KCl, 15 mMMgCb), 2 pi 0,1 M 
DTT, 1 pi RNAse inhibitor (Ambion) and 1 p,l of SuperScriptll reverse transcriptase (Invitrogen) 
and 5-10 pi of water) in a total volume of 20 pi. The reverse transcription reaction was 
20 incubated for 1-2 hours at 45''C and was terminated by heatmg at 65^*0 for 10 min. An aliquot of. 
sample (5-20 pi) was directly analyzed by PGR. Optionally, the RNA templates were degraded 
by incubation with RNAse H enzyme (Invitrogen) prior to PGR amplification. 

; Example 3. Reverse Transcription on Beads 
a. Goupling of Oligonucleotides to Beads 

25 Ultralink^'^ IodoacetylbeadsOPierce) (100-1000 pi) were washed 

buffer (50 roM Tris, 5 mM EDTA, pH 8.0) and inixed witii the solution of tMolate^ 
oligonucleotide (1-10 pM)- The coupling reaction wais initiated by addition of the reducing 
agait TCEP (Tris(2-carboxyethyl) phosphine ) (100-500 pM) and conducted for 1-2 hours at 
room temperature with continued mixing. The unreacted active groups on the beads were 

30 quenched by addition of 1% of beta mercaptoethanol for 10 min. Oligo-beads were washed 
consequentially with 10 volumes each of 4 times 5xTB buffer, 2 times 5xTE buffer at 75^, 2 
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times RT buffer at 85°C, 2x RT buffer and RT buffer at 95°C. The prepared beads were kept at 

4''C in RT buffer. 

b. Reverse Transcription 

The RNA samples (1-5 fil) were mixed with 1 \xl of dNTPs solution (10 mM) and 6-10 [xl 
5 of oligo-beads, heated for 7 min at 70°C, cooled for 2 min at 4°C and mixed with the reaction 
mixture (4 jal RT buffer (250 mM Tris-HCl (pH 8.3 at 25^C), 375 mM KCl, 15 mM MgCh), 2 ill 
0.1 M DTT, 1 ill RNAse inhibitor (Ambion) and 1 \il of SuperScriptn reverse transcriptase 
(Invitrogen) and 0-4 jil of water). The reverse transcription reaction was incubated for 1-2 hours 
at45**C. The reaction was terminated by heating at 65°C for 10 min. The beads were washed 
10 twice with PGR buffer. RNA templates were destroyed by incubation with RNAse H enzyme 
(Invitrogen) or alkaline hydrolysis. The latter reaction was carried out by addition of 3 .5 [il of 
0.5 M NaOH to the reaction mixture, incubation for 5 min at 65°C and neutralized with 3.5 pi of 
lMTrispH7.5, The beads were washed twice with PGR buffer. 

Example 4. Second Strand Synthesis 

1 5 The synthesis of the second strand of bound DNA was performed with mixture of Taq 

polymerase (Hot-start Taq, Qiagen) (0.5-1. 5u) and Pwo DNA polymerase (Roche) (0.25-0.5u) in 
PGR thennocycler programmed for 30s at 95°G, 30s at 56°C and 2 min at 72''G). The reaction 
mixture included 6- 1 0 |il of cDNA on oUgobeads from RT reaction, 5 }il of 1 Ox PGR buffer 
(Hot-start Taq, Qiagen ) or lOxRT-PCR buffer (500 mMTris, 200 mMKGl, lOOmM 

20 (NH4)2S04, 2.5 mM Mg2Gl, pH 8.5), 0.1 mM dNTPs, 0.5-1 \xl of second primer (100 isM) in a 
total volume of 50 jxL The synthesized second DNA strand was removed fix)m the beads at 96°G 
and used for further amplification. 

Alternatively second strand DNA was synthesized using DNA polymerase 1 or it Klenow 
fragment in the presence of 50 mM Tris pH 7.5, 10 mM Mg GI2 1 mM DTT, and 0.05 mg/ml 
25 BSA, 0. 1 mM dNTP, 7 mM MgGb and 1 pM of second primer for 30 min at 37°G. The . , 
synthesized second DNA strand was rraioved as described above. 

Examples. PGR Amplification: 

PGR amplification of synthesized cDNA (5-20 was amplified in the presence of 10 14,1 
of lOx PGR buffer or lOx RT-PCR buffer (see above), 2-3 mM MgCh, 0.05-0.2 mM dNTPs, 

.J 

30 0.1-1 pM of third primers labeled with either FAM, Rox, or Hex, 0-1 \xM of unlabeled third 
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primers, 1-2 \il of fourth primer, 0-5 % DMSO, 0.5-2 u of proofreading DNA polymerase (Pwo 
or Tgo (Roche)) and 1.5-3 u of hot-start DNA polymerase (e.g. Hot-Start Taq polymerase 
(Qiagen)). The amplification was conducted using "I-cycler*' (BioRad) or "PGR Express" 
(Thermo Hybaid) thermocyclCTS using following cycling program: 95**C for 30s, 56-60**C for 15- 
5 30*^0, 72^*0 for 1 min 30 sec for 30-40 cycles total. Aliqouts of samples (typically 25 ^l) were 
withdrawn after each cy^^le at the end of extension step (72°C) starting with 10-15* cycle. Equal 
volume of PGR mix containing primers, polymerase and dNTPs was placed into reaction mix 
after each sample removal. The collected samples were analyzed \ising GE system from 
■ Spectrumedix (SGE-9610 Genetic Analysis System) or ABI (3700 Prism System). 

10 Example 6 Gapillary Electrophoresis 

Gapillary electrophoresis was performed on SGE 9610 fiilly automated 96-'CapilIary 
electrophoresis genetic analysis system ifrom Spectrumedix Gorporation according to the 
manufacture's instraction, 

OTHER EMBODIMENTS 

15 The foregoing embodiments demoiistrate experiments perfonned and techniques 

contemplated by the present inventors in making and carrying out the mvention. It is believed 
that these embodiments include a disclosure of techniques which serve to both apprise the art of 
the practice of the invention and to demonstrate its useftihiess. It will be appreciated by those of 
skill in the art that the techniques and embodiments disclosed herein are preferred embodiinentis 

20 only that in general numerous equivalent methods and techniques may be employed to achieve 
the same result. 

AU of the referraces identified hereinabove/ are hereby expressly 
reference in their mtirety. 
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CLAIMS 

1 . A method for comparing gene expression profiles of two or more samples, said 
me&od comprising: 

(a) synthesizing a plurality of first strand cDNAs from a first sample using a 
5 first oligonucleotide primer comprising a sample-specific sequence tag, wherein said sample- 
specific sequence tag is GC rich at its 5' terminal and At rich at its 3'* terminal; 

(b) selectively amplifying at least a subset of said cDNA so as to generate one 
or more sample-specific amplified products; 

(c) detecting the abimdance of one or more said sample-specific amplified 

1 0 products, wherein said abundance determines an expression profile of one or more genes in said 
first sample; and 

(d) comparing the expression profile of said one or more genes in said first 
sample with an expression profile of said one or more genes in a second sample, wherein a 
difference in the expression profile indicates differential expression of said one or more genes in 

15 the two samples. 

2. The method of claim 1, wherein said step (a) comprises reverse transcribing RNA 
from two or more sample sources into first strand cDNA, and wherein said cDNA is 
dififerra^tially tagged according to their sources. 

3 . The method of claim 1 , wherein said plurality of first strand cDNAs is 

20 synthesized by reverse transcription using total RNAs or mRNAs derived from said first sample. 

4. The method of claim 1, wherein a third oligonucleotide primer comprising said 
sequence-specific sequence tag of said first oligonucleotide primer is used for said amplifying so 
as to generate one or more sample-specific amplified products. 

5. The method of claim 1, wherein at least one of said two or moris samples is 
25 derived form the group consisting of: a normal sample, a disease sample, a sample at a given 

development stage or condition, a sample prior to a given treatment stage or condition, a sample 
after a given treatment stage or condition, and a sample at a given culturing stage or condition. 
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6. The method of claim 1 , wherein at least one of said two or more samples is 

derived from the group consisting of: an animal, an organ, a tissue type, and a cell type. 

7. The method of claim 1, wherein said sample-specific sequence in said first 
oUgonucleotide primer is 15-30 nucleotides in length. 

5 8. The method of claim 1 , wherein said sample-specific sequence is 20-24 

nucleotide in length. 

9. The method of claim 1 , wherein said first oligonucleotide primer further 
comprises a sequence of 5' oligo(dT)nVN 3', where n is at least 5; V is dATP, dGTP, or dCTP; 
and N is dTTP (or dUTP), dATP, dGTP, or dCTP. 

10 10. The method of claim 1, wherein said first oligonucleotide primer is provided as a 

mixture of primers comprismg [5'-(specific sequence tag)2o-24Ti2.i6AN-3', 5'-(specific sequence 
tag)2o.24Ti2-i6CN-3% and 5"-(specific sequence tag)2o,24Ti2.i6GN-3'], wherein said specific 
sequence tags are identical or different for each primer in said mixture. 

11. The method of claim 10, wherein n is 12-1 6. 

15 12. The method of claim 10, wherein in said first oligonucleotide primer, said 

sample-specific sequence tag is located at the 5' of oUgo(dT)nVN. 

13. The method of claim 1, fiirfher comprising synthesizing one or more second 
strand cDNAs complementary to said first strand cDNAs using a second oligonucleotide primer 
comprising a first arbitrary sequence tag, wherein step (b) amplifies at least a subset of said 

20 second strand cDNAs^so as to generate one or more sample-specific amplified products. 

14. The mtethod of claim 13, wherein said second oligonucleotide primer further 
comprises a second sequence which is complementary to a subset of said first strand cDNAs so 
as to permit flie syntihesis of one or more second strand cDNAs. 

15. The method of claim 1 4, wherein in said second oligonucleotide primer, said 
25 second sequence is located 3' of said first arbitrary sequence. 

16. The method of claim 14, wherein said second oligonucleotide further comprises a 
sequence of (Z)m between said first and second sequences, where Z is a nucleotide? which can 
form base pair with any of A, T, G; or C, and m is at least 2. . / 
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17. The method of claim 16, wherein m is 4. 

18. The method of claim 14, wherein said second sequence is 5-10 nucleotides in 

length. 

19. The method of claim 18, wherein said second sequence is 6-7 nucleotides in 

5 length. 

20. The method of claim 13, wherein said first arbitrary sequence within said second 
oligonucleotide primer is 15-30 nucleotides in length. 

21. The method of claim 13, wherein said first arbitrary sequence within said second 
oligonucleotide primer comprises a A-T rich region and a G-C rich region, 

10 22. The method of claim 21, wherein said G-C rich region is located at 5' of said A-T 

rich region. 

23. The method of claim 13, wherein said second oligonucleotide primer used is the 
same for said two or more samples to be compared. 

24. The method of 4, wherein said amplifying further comprises using a fourth 
1 5 oligonucleotide primer which comprises said first arbitrary sequence tag of said second 

oligonucleotide primer. 

25. The method ofclaim 24, wherein said fourth oligonucleotide primer used is the 
same for said two or more samples to be compared. 

26. The method of claim 14, whereiii said second sequence within said second : 
IZO oUgoniicleotide primer is gene-family-specific. 

27. The method of claim 14, wherein said second sequence within said second 
oligonucleotide primer is a' sequence encoding a peptide specific for a protein family. 

28. The method of claim 27, wherein said second sequence comprises a sequence 
encoding a signature sequence motif for a specific protein fainily. 

25 29. The method of claim 28, wherein said protein family is selected fcom the group 

consisting of: receptor tyrosine kinases, G protein coupled receptors, seven transmembrane 

r" 

J. 
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receptors, ion channels, cytokine receptors, tumor markers, MAPK cascade kinases, 
transcriptional factors, GTPases, ATPases, and development protein markers. 

30. The method of claim 1, wherein said first strand cDNA is synthesized in a 
solution without attaching to a solid support. 

5 31. The method of claim 1 , wherein said first strand cDNA is synthesized attaching to 

a solid support. 

32. The method of claim 3 1 , wherein said solid support is a microparticle or an inner 
wall of a reaction tube. 

33. The method of claim 13, wherein said method further comprises separating said 
10 one or more second strand cDNA firom said plurality of first strand cDNA before amplifying said 

one or more second strand cDNAs. 

34. The method of claim 4, wherein said third oligonucleotide primer is linked to a 
detectable label. 

35 . The method of claim 34, wherem said detectable label is selected from a group 
15 consisting of: fluorescent labels, radioactive labels, colorimetrical labels, magnetic labels, and 

enzymaticlabels. 

36. The method of claim 35, wherein said detectable label is a fluorescent label. 

37. The method of claim 34, wherem said third oligonucleotide primer used for each 
of said two or more samples is labeled with a sample-specific label. 

20 38! The method of claim 1 ; wherein said one or more ainplified products are sampled 

. at a predeteraiined time or cycle interval during the amplification, 

39. The method of claim 38, wherein the abxmdance is detected for each sampled 
amplified product. 

40. The method of 1, wherein said method further comprises separating said one or 
25 more amplified products before detecting the abundance of said one or more amplified products. 



41, The method of claim 40, wherein said one or more amplified products are. 
separated and their abundance detected by chromatography. 
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42 The method of claim 40, wherein said one or more amplified products are 

separated and their abundance detected by measurement of fluorescence. 

43. The method of claim 40, wherein said one or more amplified products are separated 
and their abundance detected by measurement of optical density. 

5 44. The method of claim 40, wherein said one or more amplified products are 

separated and their abundance detected by mass spectrometry. 

45 . The method of claim 40, wherein said one or more amplified products are 
separated by electrophoresis. 

46- The method of claim 45, wherein said one or more amplified products are 
10 separated by capillary electrophoresis. 

47. The method of claim 1, wherein said difference in the expression profile of said 
one or more genes is measured by a ratio of saint)le-specific detectable labels on amplified 
products firom said genes between two or more samples. 

48. The method of claim 1, wherein said method further comprises generating an 
15 amplification plot, calculating a Ct of amplification for each of said one or more genes, and 

measuring the difference in the expression profile by a ratio of said Cts. 

49. The metiibd of claim 1, wherein said method fiirflier comprises collecting one or 
more genes which are differentially expressed and identifying the sequence identities of said one 
or more genes by DNA sequencing. 

20 50. The method of claim 1, wherein sddampli^r^ 

51. A method for comparing gene expression profiles of two or more samples, said 
method comprising: 

(a) synthesizing a plurality of first strand cDNAs fi-om a first sample using a 
first oligonucleotide primer comprising a sample-specific sequence tag, wherein said first . 

25 oligonucleotide primer comprises at least one degenerate nucleotide; . 

(b) selectively amplifying at least a subset of said cDNA so as to generate one 
or more sample-specific amplified products; / 
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(c) detecting the abundance of one or more said sample-specific amplified 
products, wherein said abundance determines an expression profile of one or more genes in said 
first sample; and 

(d) comparing the expression profile of said one or more genes in said first 
5 sample with an expression profile of said one or more genes in a second sample, wherein a 

difference in the expression profile indicates differential expression of said one or more genes in 
the two samples. 

52. A method for comparing gene expression profiles of two or more samples, said 
method comprising: 

10 (a) synthesizing a plurality of first strand cDNAs firom a first sample using a 

first oUgonucleotide primer comprising a sample-specific sequence tag, wherein said sample- 
specific sequence tag comprises at least one artificial nucleotide; 

(b) selectively amplifying at le£ist a subset of said cDNA so as to generate one 
or more sample-specific amplified products; 

15 (c) detecting the abimdance of one or more said sample-specific amplified 

products, wherein said abundance determines an expression profile of one or more genes in said 
first sample; and 

(d) comparing the expression profile of said one or more genes in said first 
sample with an expression profile of said one or more genes in a second sample, wherein a 
20 difference in the expression profile indicates differential expression of said one or more genes in 
the two. samples. j 

53. . A miethod for comparing gene expression.profiles of two or more samples, said 
method comprising: 

. (a) ^synthesizing a plurality of first strand cDNAs firom a first sample using a 
25 first oligonucleotide primer comprising a sample-specific sequence tag, wherein said sample- 
specific sequoDLce tag is GC rich at its 5' terminal and AT rich at its 3 ' terminal; 
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(b) selectively synthesizing one or more second strand cDNAs 

complementary to said first strand cDNAs using a second oligonucleotide primer comprising a 
first arbitrary sequence tag; 

(c) amplifying said one or more second strand cDNA so as to generate one or 
5 more sample-specific amplified products; 

(d) detecting the abundance of one or more said sample-specific amplified 
products, wherein said abundance determines an expression profile of one or more genes in said 
first sample; and 

(e) comparing the expression profile of said one or more genes in said first 
10 sample with an expression profile of said one or more genes in a second sample, wherein a 

difference in the expression profile indicates differential expression of said one or more genes in 
the two samples. 

54. A method for comparing gene expression profiles of two or more samples, said 
method comprising: 

15 (a) synthesizing a pluraUtyoffirst strand cDNAsfirom a first samp 

first oligonucleotide primer comprising a sample-specific sequence tag, wherein said first 
oligonucleotide primer comprises at least one degenerate nucleotide; 

(b) selectively synthesizing one or more second strand cDNAs 
complementary to said first strand cDNAs using a second oligonucleotide primer comprising a 
20 first a^rbitrary sequence tag; 

' (c) amphfying said one or more second strand cDNA so as to generate one or 
more sample-specific amplified products; 

(d) detecting the abundance of one or more said sample-specific amplified 
products, wherein said abundance determines an expression profile of one or more genes in said 

25 first sample; and 

(e) comparing ttie expression profile of said one or more genes in said first 
sample with an expression profile of said one or more genes in a second sample, wherein a 
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difference in the expression profile indicates differential expression of said one or more genes in 
the two samples. 

55. A method for comparing gene expression profiles of two or more samples, said 
method comprising: 

5 (a) synthesizing a plurality of first strand cDNAs fi"om a first sample using a 

first oligonucleotide primer comprising a sample-specific sequence tag, wherein said sample- 
specific sequence tag comprises at least one artificial nucleotide; 

(b) selectively synthesizing one or more second strand cDNAs 
complementary to said first strand cDNAs using a second oligonucleotide primer comprising a 

10. . first arbitrary sequence tag; 

(c) amplifying said one or more second strand cDNA so as to gCTierate one or 
more sample-specific amplified products; 

(d) detecting the abxmdance of one or more said sample-specific amplified 
products, wherein said abundance determines an expression profile of one or more genes in said 

15 first sample; and 

(e) comparing the expression profile of said one or more genes in said first 
sample with an expression profile of said one or more genes in a second sample, wherein a 
difference in the expression profile indicates differential expression of said one or more genes in 
the two samples. 

20 56. A method of identifying a modulator which regulates one or more gene 

" e3q)ression in a sample; said method comprising: 

(a) synthesizing a plurality of first strand cDNAs, before contacting said 
sample with said modulator, using a first oligonucleotide primer comprising a sample-specific 
sequence tag, wherein said sample-ispecific sequence tag is GC rich at its 5' terminal and At rich 

25 at its 3' terminal; 

(b) selectively amplifying at least a subset of said cDNA so as to generate one 
or more sample-specific amplified products; - 
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(c) detecting the abundance of one or more said sample-specific amplified 
products, wherein said abundance determines an expression profile of one or more genes in said 
sample; and 

(d) comparing the expression profile of said one or more genes in said sample 
5 before contacting with said modtilator with an expression profile of said one or more genes in 

said sample after contacting said modulator, wherein a difference in the expression profile 
indicates said modulator regulating one or more gene expression in said sample. 

57. A method of identifying a modulator which regulates one or more gene 
expression in a sample, said method comprising: 

10 (a) synthesizing a plurality of first strand cDNAs, before contacting said 

sample with said modulator, using a first oligonucleotide primer comprising a sample-specific 
sequence tag, wherein said first oligonucleotide primer comprises at least one degenerate 
nucleotide; 

(b) selectively amplifying at least a subset of said cDNA so as to generate one 
15 or more sample-specific amplified products; 

(c) detecting the abundance of one or more said sample-specific amplified 
products, wherein said abundance determines an expression profile of one or more genes in said 
sample; and 

(d) comparing the expression profile of said one or more genes in said sample 
20 before contacting with said modulator with an expression profile of said one or more genes in 

said sample after contacting said modulator, wherein a difference in the expression profile 
indicates said modulator regulating one or more gene expression in said sample. 

58. A niethod of identifying a modulator which regulates one or more gene 
expression in a sample, said method comprising: 

25 (a) synthesizing a plurality of first strand cDNAs, before contacting said 

sample with said modulator, using a first oligonucleotide primer comprising a sample-specific 
sequence tag, wherein said sample-specific sequence tag is GC rich at its 5' temiinal and At rich 
at its 3' terminal; 
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(b) synthesizing one or more second strand cDNAs using a second 

olig6nucleotide primer comprising a first arbitrary sequence tag; 

(c) amplifying said second strand cDNAs so as to generate one or more 
sample-specific amplified products; 

5 (d) detecting the abundance of one or more said sample-specific amplified 

products, wherein said abundance detemiines an OTqpression profile of one or more genes in said 
sample; and 

(e) comparing the expression profile of said one or more genes in said sample 
before contacting with said modulator with an expression profile of said one or more genes in 
10 . said sample after contacting said modulator, wherein a difference in the expression profile 
indicates said modulator regulating one or more gene expression in said sample. 

59. A method of identifying a modulator which regulates one or more gene 
expression in a sample, said method comprising: 

(a) synthesizing a plurality of first strand cDNAs, before contacting said 
15 sample with said modulator, using a first ohgonucleotide primer comprising a sample-specific 

sequence tag, wherein said first oligonucleotide primer comprises at least one degenerate 
nucleotide; 

(b) synthesizing one or more second strand cDNAs using a second 
oligonucleotide primer comprising a first arbitrary sequence tag; 

20 (c) amplifying said second strand cDNAs so as to generate one or more 

•sample-specific amplifi:ed products; 

(d) detectingtheabundanceof one or more said sample-specific amplified 
products, wherein said abundance determines an expression profile of one or more genes in said 
sample; and 

25 (e) comparing the expression profile of said one or more genes in said sample 

before contacting with said modulator with an expression profile of said one or more genes in 
said sample after contacting said modulator, wherein a difference in the expressibn profiOle 
indicates said modulator regulating one or more gene expression in said sample. 
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60. A composition for detecting the level of gene expression, comprising a first 
oligonucleotide primer, wherein said first oligonucleotide primer comprises a sample-specific 
sequence tag and wherein said sample-specific sequence tag is GC rich at its 5' terminal and AT 
rich at its 3 ' terminal, 

5 61. The composition of claim 60, further comprising a second oligonucleotide primer 

which comprises a first arbitrary sequence tag. 

62. ITie composition of claim 61, further comprising a third oligonucleotide primer 
comprising said sequence-specific sequence tag of said first oligonucleotide primer. 

63. The composition of claim 62, further comprising a fourth oligonucleotide primer 
10 which comprises said first arbitrary sequence tag. 

64. The composition of claim 60, wherein said second primer further comprises a 
second sequence which is complementary to a sequence of said first strand cDNA. 

65. The composition of claim 60, further comprising one or more components 
selected firom the group of: a reverse transcriptase, a DNA polymerase, a reaction buffer for said 

15 • reverse transcriptase, a reaction buffer for said DNA polymerase, and dNTPs. 

66. A composition for detecting the level of gene expression, comprising a first 
oUgonucleotide primer, wherein said first oligonucleotide primer comprises a sample-specific 
sequence tag and wherein S2iid first oligonucleotide primer comprises at least one degenerate 
nucleotide. 

20 67. A kit for detecting the level of gene expression, comprising a first oligonucleotide 

primer, wherein said first oligonucleotide primer comprises a sample-specific sequence tajg and 
wherein said sample-specific sequence tag is GC rich at its 5' terminal and AT rich at its 3 V 
temiinal, and packaging material thereof. 

68. The kit of claim 67, further comprising a second oligonucleotide primer which 
25 comprises a first arbitrary sequence tag. 

69. The kit of claim 67, further comprising a third oUgonucleotide primer comprising 
said sequence-specific sequence tag of said first oligonucleotide primer. 
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70. The kit of claim 68, further comprising a fourth oligonucleotide primer which 
comprises said first arbitrary sequence tag. 

71, The composition of claim 68, wherein said second primer further comprises a 
second sequence which is complementary to a sequence of said first strand cDNA. 

5 72. The composition of claim 67, further comprising one or more components 

selected firom the group of: a reverse transcriptase, a DNA polymerase, a reaction buffer for said 
reverse transcriptase, a reaction buffer for said DNA polymerase, and dNTPs. 

73. A kit for detecting the level of gene expression, comprising a first oligonucleotide 
primer, wherein said first oligonucleotide primer comprises a sample-specific sequence tag and 
10 wherein said first oligonucleotide primer comprises at least one degenerate nucleotide, and 
packaging material thereof 
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FIGURE 1 
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mRNAs 



Primer 1 A 



Primers lA and IB: 

3'«NV(dT)i2.i6(tinique sequence A)2o.24-5' 
3'-NV(dT)i2.i6(unique sequence 8)20-24-5' 

V=»A,C.G N=A.T.C,G 



DNAs 



Reverse 
transcription 
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FIGURE 2 



DNAs n n 

-B □ 



Primer 2: 

A) (^1^-1 - --qy^n^^ ^-Kr i.(Z)x.,(XXXXX)s-io 
X=A,T,G,C 

Z= artificial base,.p.airing with all four nucleotides 

B) cii uic»^*? iH;4|M«:.oc:e £;^,.^3s(XXXXX)s.io 
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FIGURE 6 



I 



3'-end of primer contains 
recognition site for restriction 
enzyme 



Restriction enzyme 



Restriction enzyme removes the 
region complimentary to primer 




Forward primer cannot anneal to 
digested fragment and cannot 
extend complimentary strand 




Amplification stops 
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Sample A 
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FIGURE? 
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Sample B 




mRNA preparation 
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polyA 



First cDNA strand synthesis using a first primer (1 A or IB) comprising a sample- 
specific-sequence tag (Tag A or Tag B): (Tag A or Tag 0)20.24 (dT)i2.i6 VN 
Preferably Tag A or B would be GC rich at 5* and AT rich at 3'(e.g^ >70% GCs or ATs); 
Also preferably Tag A or B would comprise artificial nucleotides. 
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polyA 



Primer lA 



polyA 



'^^^'-^ Primer IB 
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Primer 1 A 



5!- 



polyA 



Primer IB 



Second cDNA strand synthesis using a common second primer (2) comprising a first arbitrary sequence 
tag X and a second arbitrary sequence tag Y. There may be one or more degenerate bases (Z) between 
tag X and tag Y. Tag Y allows the generation of second strand cDNAs for a specific set of genes: 

(TagX)io.ii(Z)4(TagY)„^ 
Preferably Tag X would be GC rich at 5' and AT rich at 3'(e.g.» >70% GCs or ATs); Also preferably Tag X would 
comprise artificial nucleotides. 
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%>^Z)r™B — - 

%^ 



J' 
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Second strand cDNAl 



^may be 



separated 



PGR amplification using primers 3 and 4: 

- Primer 4 comprises the sequence of TagX and is common for both samples; 

- Primer 3 comprises the sequence of either tag A or Tag B (sample-specific); 
and 

- Primer 3 is differentially labeled (e.g^ by fluorescent labels). 

The amplified PGR products would be differentiaUy labeled according to their 
source. 

The abundance of an amplified PGR product is resolved by capillary 
electrophoresis after each amplification cycle. 

The abundance of the amplified PGR product is compared between the two 
samples. 
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